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REAL PARTY IN INTEREST 

The real party in interest is Sunol Molecular Corporation of Miramar, Florida. An 
assignment from the inventors to Dade International was recorded on August 18, 1997 at Reel/ 
Frame 8681/0081. An assignment from Dade International to Sunol Molecular Corporation is 
being filed on today's date under separate cover. 

RELATED APPEALS AND INTERFERENCES 

There are no related appeals or interferences known to Appellants or Appellants' 
representatives that will directly affect or be directly affected by or have a bearing on the Board's 
decision in the pending Appeal. 

STATUS OF THE CLAIMS 

Claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 stand finally rejected under 35 U.S.C. §103 and 
are pending on appeal. Claims 5, 6, 10-12, 16-20, 61, 65, 70, 73 and 74 have been cancelled. 
Claims 3, 9, 13, 15, 21-60, 62-64, 66 and 68 were withdrawn from consideration by the 
Examiner. 

STATUS OF THE AMENDMENTS 

Claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 on appeal are set forth in Exhibit A hereto. 
There has been no further amendment to the claims. 

SUMMARY OF THE CLAIMED INVENTION 

The claimed invention features a soluble fusion protein engineered to include a 
bacteriophage coat protein fused to a single-chain T cell receptor ("scTCR"). The single-chain T 
cell receptor was itself designed to include an alpha-variable region ("V-ct") fused to a beta- 
variable region (" V-P"). The single-chain T cell receptor forms a pocket that binds antigen when 
the antigen. The claimed soluble fusion protein further includes a beta-constant region ("C-P") 
region that can be fused to V-p, for example. 
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T cells help defend the body against infection. The cells have membrane bound receptors 
that bind foreign antigen with the assistance of a protein complex called "MHC". A key receptor 
is called the T cell receptor ("TCR"). The chemical structure and function of the TCR has been 
extensively studied. For instance, it is known that formation of a TCR-antigen-MHC complex is 
an important step toward fighting infection. 

Appellants discovered that by adding a bacteriophage coat protein to the scTCR, it is 
possible to produce a fully soluble and functional scTCR. Unlike prior scTCRs, the claimed 
fusion proteins were found to be fully soluble, functional, and obtainable in significant quantities 
without difficulty. The claimed fusion proteins have a wide spectrum of important uses as 
described throughout the instant patent application. 

See the Summary Of The Invention at pg. 3, line 22 to pg. 11, line 21. See also the 
Background at pg. 1, line 12 to pg. 3, line 19 for related information. None of the art of record in 
this case shows an attempt to make a scTCR that includes a fused bacteriophage coat protein. 

ISSUE 

This appeal presents the issue of whether the Examiner erred in rejecting claims 1, 2, 4, 7, 
8, 14, 67, 69, 71 and 72 under 35 U.S.C. § 103 in view of Chung, S. et al. (1994) Proa Natl 
Acad. Set (USA) 91: 12654 in view of U.S Pat. No. 5,759,817 to Barbas, Onda, T et al. (1996) 
Mol Immunol 32: 1387; and Huse et al. (1992) J. Immunol 149: 3914. Appellants will refer to 
these citations as "Chung", "Barbas", "Onda" and "Huse", respectively; unless stated otherwise. 
There are no other pending rejections of record in this case. 

GROUPING OF THE CLAIMS 

All of claims 1, 2, 4, 7, 8, 14, 67, 71 and 72 stand or fall together for the purpose of the 
present appeal. 

CASE HISTORY 
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In consideration of the Examiner's position in this case, Appellants have summarized the 
prosecution history with respect to the 35 U.S.C. §103 rejection at issue. A more detailed 
discussion of Appellants' rebuttal to those arguments will follow under Argument . 

A. A patent application was filed on March 7, 1997 with 59 claims and assigned 
Serial No.: 08/813,781 by the USPTO. 

B. An Office Action was mailed to Appellants on August 25, 2000 by Examiner 
Schwadron. The Examiner stated the following with respect to the §103 rejection on appeal: 



Claims 1, 2, 4, 7, 8, 14, 67, 69, 71, 72 are rejected under 35 U.S.C. 103(a) 
as unpatentable over Chung et al. in view of Barbas US 5,759,817 (filed Jan. 27, 
1992), Onda et al. (Molecular Immunology 32:1387, 1995), and Huse et al. J. 
Immunology 149:3914. 1992 

Chung et al. teaches a single chain T cell receptor which specifically binds 
to peptide ligand (see abstract). Chung et al. further teaches one embodiment of 
human single chain TCR in which C-terminus of V a domain is linked to N- 
terminus of V P chain via a 15 amino acid residue flexible amino acid liner and 
the C-terminus of the V p chain is linked to the beta chain constant domain (see 
Figure 1). In one embodiment the C terminus of V p chain is linked to a alkaline 
phosphatase (PI) protein tag (see page 12655). Chung et also teach that the 
purpose of the linker is to enhance the binding characteristics of the soluble T cell 
receptor and that linkers of about 10 to 30 amino acid residues would be 
considered to be sufficient. Chung et al. teach that the TCR fusion protein can 
bind antigenic protein, thus teaching that the TCR fusion protein comprises an 
antigen binding pocket. Chung et al. teaches a TCR fusion protein comprising V- 
a-peptide linker-V P-C P linked to GPI anchor and expression of such a fusion 
protein in a transfected eukaryotic cell (see results section). Chung et al. disclose 
that the soluble form of TCR protein could be readily obtained by enzymatic 
cleavage with phosphatidylinostol-specific phospholipase C (PI-PLC) (see page 
12656). Chung et al. teaches expression of said TCR fusion protein in a bacterial 
cell system in which the N terminus of the CP region is linked to a histidine 
protein tag. Chung et al. also disclose a scTCR in which comprises V-a-peptide 
linker-V p-C p GPI in which the Cp component consists of the p chain sequence 
ending right before the last cysteine (the sixth cysteine) (see page 12655). Chung 
et al. further teach that TCR fusion proteins which do not contain the CB do not 
fold into the native conformation. The scTCR disclosed by Chung et al. meet the 
length limitations of the Va. and Vp region recited in claims 69 and 71. Chung et 
al. teach a soluble fusion protein comprising a Va-peptide linker-Vp-Cp 
fragment-protein tag (eg. GPI). Chung et al. does not teach a TCR fusion 
protein further comprising bacteriophage VIII coat protein. 
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However, Barbas discloses a soluble fusion protein comprising a 
bacteriophage coat protein fragment covalently linked to a single-chain 
heterodimeric receptor (see abstract and column 15, lines 27-28, in particular). 
Barbas also discloses that the fusion protein may comprise domains of 
heterodimeric proteins derived from several ligand binding proteins, including 
immunoglobulins and T cell receptors (see column 17, lines 62-66 and column 19, 
lines, 9-28. Barbas discloses that T cell receptor comprises alpha and beta chains 
each having a variable(V) and constant(C) region and T cell receptor has 
similarities in genetic organization and function to immunoglobulins (see column 
19, lines 19-22, in particular). Barbas also teaches that bacteriophage coat protein 
may be derived from cpIII or cpVUI (see column 31, lines 10-28, in particular). 
Barbas discloses that expression vectors expressing soluble fusion proteins in 
which the ligand binding region is fused to bacteria coat protein allows the 
expression of the multiple fusion proteins on the surface of phage particles IE 
approximately 2700 cpVIII heterodimer receptor molecules per phage particle 
(see column 39 line 64 through column 40, line 7, in particular). Barbas further 
discloses that a short length of amino acid sequence at the amino end of a protein 
(IE a protein tag) directs the protein to periplasmic space (see column 8, lines 49- 
55, in particular. One embodiment of the invention is disclosed to be a fusion 
protein comprising in sequence a leader sequence-peptide linker-V region amino 
acid residue-peptide linker-phage coat protein and that in one embodiment, the 
second linker can define a proteolytic cleavage site which allows the 
heterodimeric receptor to be cleaved from the bacteriophage coat protein to which 
it is attached (see column 14, lines 60-65). Thus Barbas discloses but does not 
exemplify a soluble fusion protein comprising a bacteriophage coat protein 
covalently linked to T cell receptor domains. 

Onda et al. disclose a soluble fusion protein comprising a bacteriophage 
coat protein covalently linked to a single-chain T cell receptor by a peptide linker 
sequence wherein the single TCR chain is the alpha chain and the bacteriophage 
coat protein is cpVIII (see abstract and Figure 1, in particular). Onda et al. also 
teach that TCR-bacteriophage coat protein fusion protein can be used to study 
specific binding interactions of the TCR chain to antigenic ligands (see paragraph 
bridging pages 1394-1395, in particular). 

Huse et al. teach that fusion proteins comprising a single chain fusion 
protein comprising Fab fragment of immunoglobulin (which comprises the 
antigen binding pocket of the immunoglobulin molecule) and bacteriophage VIII 
coat protein can be produced and display the fusion protein when expressed in a 
Ml 3 derived vector. Huse et al. further teach that bacteriophage VIII coat protein 



Examiner Schwadron withdrew this statement in the next Office Action dated June 17, 2002 (see below). The 
statement was originally made by a prior Examiner (Lubet) in a related § 1 03 rejection that has been withdrawn. In 
that earlier rejection, Lubet argued that Barbas and Onda do not teach a soluble fusion protein in which a single- 
chain TCR linked to a bacteriophage coat protein. See the Office Action dated June 23, 1998 at pg. 8, part B. In the 
§103 rejection on appeal, Examiner Schwadron argued that Barbas and Onda teach use of TCR -bacteriophage VIII 
fusion protein. That molecule, a heterodimeric T cell receptor fusion protein, is not the claimed invention. 
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fusion protein can recovered from culture medium or from the periplasmic space 
(see abstract). 

Therefore it would have been prima facie obvious to one with ordinary 
skill in the art at the time the invention was made to make a soluble TCR fusion 
protein comprising the Va.. -peptide linker- Vp -Cb fragment-protein taught by 
Chung et al. linked to a bacteriophage VIE coat protein because Barbas et al. 
and Onda et al. teach TCR-bacteriophage VIII coat fusion proteins can be 
used to study antigen binding properties of such a fusion protein and Huse et 
ah teach that fusion proteins comprising bacteriophage VIII coat protein can be 
produced in bacteria and recovered in relatively large quantities. 

One with skill in the art would be motivated to make such a fusion protein 
to study the antigen binding region of the TCR component or to use the protein to 
elicit anti-idiotypic antibodies. One with skill in the art would be motivated to 
make such a fusion protein in which the Va and Vp region was derived from 
human TCR in order to study human TCR properties or to elicit anti-idiotypic 
antibodies to the TCR component of the protein. 

The preceding grounds of rejection have been maintained since the August 25, 2000 
Office Action despite Appellants' rebuttal argument and claim amendments discussed below. 

C On October 25, 2000, Appellants' representative met with Examiner Schwadron at 
the USPTO and discussed the art cited. No agreement was reached. 



D. Appellants submitted a response to the rejection set forth in paragraph B, above, on 
February 22, 2001. In that response, Appellants rebutted the prima facie obviousness argument 
by pointing out: 1) that the cited references did not teach or suggest that the membrane "anchor" 
of Chung's single-chain TCR (GPI: a membrane protein) could be substituted with the 
bacteriophage coat protein of Barbas' TCR; 2) that there was no reasonable expectation that the 
substitution (switching Chung's anchor for Barbas' phage coat protein) could be achieved in view 
of substantial differences between scTCRs and TCRs; and 3) that the Examiner's citation of 
Onda was not correct ie., it does not disclose TCR-bacteriophage coat protein fusions, but 
instead, dwelt on smaller constructs having only a V-a chain (but no V-p chain). Onda 
characterized his constructs as having "unusual" binding properties that were not characteristic of 
TCRs. A subset of such constructs were reported by Onda not to work at all. 
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Appellants also discussed the Holler reference: a peer-reviewed scientific article from 
the U.S. Academy of Sciences (PNAS (USA) (2000) 97: 5387 at 5389). Holler provided 
independent and objective evidence of the long-felt need and failure of others in the field to 
make and use the claimed fusion molecules. Specifically, Holler stated that phage display had 
not yet proven successful in the engineering of scTCRs. 2 



E. In response to the Appellants' arguments in paragraph D above, the Examiner 
issued a Final Office Action dated June 17, 2002. 3 The Examiner maintained the prima facie 
rejection and stated: 



Regarding appellants comments, while heterodimeric molecules are a 
preferred embodiment disclosed in Barbas et al., Barbas et al. disclose: 
"In another embodiment, the present invention contemplates a polypeptide 
comprising an insert domain flanked by an amini-terminal secretion signal 
domain and a carboxy-terminal filamentous phage coat protein membrane anchor 
domain." (column 14, first complete paragraph). 

Barbas et al. further disclose than said construct could include a "receptor 
protein" (column 14, second paragraph), indicating that the disclosed method 
could be used for receptors per se (eg. single chain or heterodimeric or single 
chain heteromers). Single chain T cell receptors were known in the art (see Chung 
et al.). 

Regarding appellants comments about the single chain TCR taught by 
Chung et al, 

Chung et al. teach that the GPI anchor is cleaved and the soluble TCR still 
has all the antigen binding properties of the TCR (see pages 12656-12658). Thus, 
the GPI anchor is not required for the soluble TCR to function, it is just used in 
one particular method of making the soluble TCR. Regarding motivation to create 
the claimed invention, Chung et al. discloses that it would be desirable to 
produce their TCR in a phage display system (see page 12658, first column). 
In addition, Barbas et al. teach the advantages of their system for the production 



By "phage display" is meant the process of making a recombinant bacteriophage expressing the scTCR as part of 
the phage protein coat. After infecting bacteria with a recombinant phage engineered to produce the scTCR, the 
protein would be "displayed" on the bacterial cell surface as bacteriophage. The scTCR "displayed" in this manner 
would be amendable to engineering. 

The Examiner essentially repeated his rejection of the claims as set forth in the August 25, 2000 rejection. But see 
footnote 1 . With respect to Examiner Schwadron f s discussion about Holler, Appellants submitted Weidanz et al. as 
part of a Rule 132 Declaration to address an obviousness rejection that has since been withdrawn. The reference 
provided evidence that a particular scTCR-bacteriophage coat protein vector (pKC44) was capable of forming an 
antigen binding site when expressed. The reference is not prior art to the present application. 
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of peptides. Regarding reasonable expectation of success, both Barbas et al. and 
Chung et al. disclose use of phage display systems to produce single chain 
antibodies (see column 2, third paragraph from bottom and page 12658, first 
column). In addition, the soluble single chain TCR molecules functions with or 
without the GPI linker indicating that the construct itself is functional. 

Regarding appellants comments about Holler et al, said publication was 
published in May 2000. In the amendment filed 6/3/2000, applicant submitted a 
publication by Weidanz et al. (J. Imm. Methods 1998) which discloses the 
claimed invention. Thus, it appears that Holler et al. simply are not familiar 
with the prior art. Thus, the comments of Holler et al. carry no weight 
because two years prior to the Holler et al. publication, Weidanz et al. had 
already published data regarding the production of single chain TCR using 
bacteriophage. Furthermore, Holler et al. discloses a yeast system for producing 
a single chain TCR and it appears that the main focus of Holler et al. is to promote 
their system. 

Regarding appellants comments about Onda et al., the instant rejection 
indicates that "Onda et al. disclose a soluble fusion protein comprising a 
bacteriophage coat protein covalently linked to a single-chain T cell receptor 

by a peptide linker sequence wherein the single TCR chain is the alpha chain and 
the bacteriophage coat protein is cpVIII (see abstract and Figure 1, in particular)". 
The art recognizes that the alpha and beta chains of the TCR generally both are 
involved in antigen binding. The art also recognizes that soluble TCR which bind 
antigen would have a variety of uses. 

F. Appellants filed a Notice of Appeal on October 17, 2002. 

ARGUMENTS 

As an initial mater, Appellants wish to emphasize the substantial differences between 
TCR heterodimers ("TCRs") and single-claim T cell receptors ("sc-TCRs"). 

The TCR is a heterodimer with one a chain and one p chain. 4 See Exhibit A (i). Each of 
these chains passes from the exterior of the T cell, through the cell membrane, and into the cell 
interior (cytosol). The a and P chains each include a variable (V -a, V-P) region that cooperate to 
form an antigen binding pocket. The regions are "variable" because its chemical structure can be 



A textbook in the field describes the TCR as "a heterodimer composed of an a and a P polypeptide chain, both of 
which are glycosylated." See Alberts, B et al. (1989) in Molecular Biology of the Cell, 2 nd Ed. Garland Publishing, 
Inc. New York at pg. 1037. By convention, a "heterodimer" such as the TCR properly has two chains (dimer) both 
of which are different (hetero) from the other. 
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changed to make a pocket that fits another antigen. Each of the V-a and V-p regions are 
associated with a constant (C) region. 

TCR heterodimers have been extremely difficult to isolate from T cells. This problem 
has hindered study of the receptor. One approach to address the problem has been to make 
single-chain T cell receptors ("scTCRs"). These synthetic receptors include, on one chain 
instead of two, a fused V-a and V-{3 region. See Exhibit A (ii). It has been customary to space 
the V-a and V-p regions from each other with a flexible linker to allow the regions to make an 
antigen binding pocket. Unfortunately, many scTCRs have still proven to be difficult to make 
and use. 

Appellants point out that "TCR M is understood in the field to mean a heterodimeric T 
cell receptor. The TCR is a membrane bound (insoluble) receptor in which the a and p chains 
cooperate to bind antigen. Reference to a "scTCR" is understood to mean a synthetic single- 
chain molecule that includes the V-a and V-P regions bound together usually through a flexible 
linker. Unlike the TCR, the scTCR binds antigen with only one chain. The TCR and scTCR are 
structurally distinct proteins that are different molecules that bind antigen in different ways. 
Compare Exhibit A (i)-(ii) (showing eg., substantial structural differences between a TCR and 
scTCR). 

L Summary of the Cited Art 

A. Chung reports functional three-domain single-chain T cell receptors consisting of a 
human Va and Vp region that recognizes a particular antigen (HLA-DR2b/myelin basic protein). 
Chung determined that it was important to fuse a Cp region to the VP region. Such a three 
domain construct, when linked to a synthetic cell membrane anchor (glycosyl phosphatidyl- 
inositol (GPI) or CD3£ fragment), was found to be expressed and functional. Chung disclosed 
that the cell membrane anchor could be cleaved from the single-chain receptors to obtain soluble 
protein. See the Abstract. 

Chung opined that his single-chain design "may allow" construction of TCR phage 

libraries and that such libraries "may be" tools for studying TCRs. See pg. 12658. However, 

there is no specific disclosure in Chung about how such libraries could be made or, if made, 

whether his single-chain TCRs could tolerate fusion of the bacteriophage coat protein. Chung 
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does not report or suggest that the GPI or CD3£ membrane anchor could be substituted with a 
bacteriophage coat protein. Even if there was such a teaching, there is no disclosure in the 
reference about whether a recombinant bacteriophage could tolerate Chung's scTCR as part of 
the phage coat. 

B. Barbas discloses heterodimeric receptor libraries that use phagmids. In the Abstract, 
phage are taught to encapsulate a genome encoding first and second polypeptides of a receptor 
such as an antibody; in which the first and second polypeptides are integrated into the coat 
matrix of the phage. Barbas generally discloses that such phage may include a polypeptide with 
an "insert domain" that has a receptor domain flanked by a secretion signal domain and a phage 
coat protein membrane anchor domain. Col. 14, lines 10-14. Heterodimeric receptors are 
preferred. See Col. 3, lines 1-41; Col. 14, lines 15-29; and Col. 15, lines 28-32. According to 
Barbas, there was some uncertainty in the field about which portions of bacteriophage coat 
proteins were needed for phage assembly. Col. 2, lines 19-46. 

Barbas does not teach how to make or use a scTCR with or without a fused bacteriophage 
coat protein. 

C. Onda reports use of a phage display system to explore binding interactions between 
the V-a region and antigen. Onda did not disclose use of the system to study TCR or single- 
chain TCR interactions. In the Abstract, Onda provides at pg. 1387: 

We utilized an Ml 3 phage display system, designed for multivalent 
receptor display, to explore specific binding interactions between various TCR a 
chains and specific antigen in the absence of MHC. 

That is, Onda fused only the V-a region to bacteriophage coat protein. The constructs do not 
include a V-p region and are not scTCR fusion proteins. Onda's fusions are much smaller TCR "half- 
molecules" lacking the V-p region and antigen binding pocket of Appellants' scTCR. 

Onda at pg. 1395, col. 1, cautioned that his constructs were unusual and not typical of 

standard TCR interactions: 

Our results extend these findings by demonstrating that the dominant interactions 
of certain TCRct chains for peptide antigens may be sufficiently high that they can be 
analysed independently. However, these interactions are quite unusual in that they do 
not require the expression of the second TCR subunit or normal MHC and coreceptor 
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interactions. These results may raise concern that this model does not reflect typical 
TCR-ligand interactions. 

Significantly, only some of Onda's Va chain fusion proteins were reported to bind 

antigen when fused to bacteriophage coat protein. At pg. 1395, col. 2 he states that: 

...only a subset of TCR Va have capacity for direct interactions with antigen 
strong enough to be detectable in this system. 

Onda does not teach or suggest fusing a scTCR (V-a and V-|3) to a bacteriophage fusion 
protein. 

D. Huse described a phage vector system for screening and producing antibody F(ab) 

fragments. 5 Huse's system was reportedly used to produce free F(ab) and F(ab) displayed on the 

surface of bacteriophage. According to Huse however, not all attempts to produce F(ab) were 

successful. In more than a few instances, the recombinant bacteriophage made to produce the 

fusions apparently would not tolerate certain amounts of antibody protein. In describing 

attempts to display certain antibody H and L chains with his phage vector system, Huse stated on 

pg. 3919, col. 2 that: 

Phage titers of [phage vector] infected cultures were found to decrease 
relative to the level of F(ab)-pVIII fusion protein incorporation (cite omitted). 
Taken together, these results suggest that a functionally viable phage particle 
may be able to tolerate a limited number of incorporated F(ab)-pVIII fusion 
products and that the amount of F(ab) incorporated into the phage coat may 
adversely affect phage titers and overall F(ab) yield. 

Huse does not disclose fusing a bacteriophage coat protein to a scTCR or TCR. 

II. Summary of the Examiner's Argument 

Grounds for the present rejection under 35 U.S.C §103 were formulated in the Office 
Action dated August 25, 2000. See paragraph B, above. The basis for the rejection has not 
changed substantially in the face of Appellants' arguments and claim amendments. 

According to Appellants' understanding of the Examiner's alleged prima facie case, the 
primary references, Chung and Barbas, are alleged to teach a scTCR linked to a bacteriophage 



F(ab) is an abbreviation for an antigen binding fragment of an antibody ( fragment antigen binding). F(ab) is a 
heterodimer consisting of two different chains ie., the antibody light and heavy chain. F(ab) is readily made by 
cleaving whole antibodies with specific proteolytic enzymes. 
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fusion protein. Onda and Barbas are relied on to teach that TCR-bacteriophage fusion proteins 
can be used to study antigen binding. Huse is used to teach that fusion proteins with the coat 
protein can be made in bacteria. The foundation of the Examiner's position is that because 
Barbas, Onda, and Huse teach some bacteriophage coat protein fusions, then it would be obvious 
to make fusions with Chung's scTCRs. Although facially somewhat plausible, the rejection is 
flawed on both scientific and legal principles as discussed herein. 

IQ. The Examiner Erred in Rejecting Claims K 2, 4, 7, 8. 14. 67, 69, 71 and 72 as Being 
Obvious 

A. Requirements of the prima facie case and its maintenance. 

The Examiner erred in maintaining the obviousness rejection in the face of claim 

amendments and the state of the art as submitted made in this case and its parent. The Federal 

Circuit has reiterated that an Examiner's prima facie case is but a procedural tool of patent 

examination, with the express purpose of allocating the burdens of going forward as between the 

Examiner and Applicant. See In re Deckler 977 F.2d at 1449, citations omitted): 

Specifically, when obviousness is at issue, the examiner has the burden of persuasion 
and therefore the initial burden of production. Satisfying the burden of persuasion, 
constitutes a so-called prima facie showing. Once that burden is met, the applicant has 
the burden of production to demonstrate that the examiner's preliminary determination is 
not correct. The examiner, and if later involved, the Board, retain the ultimate burden of 
persuasion on this issue. 

Clearly, as demonstrated herein, adequate evidence of the unobviousness of the claimed 
invention was provided by Appellants to shift the burden of persuasion to the Examiner. 

In view thereof, it is requested that the Board review the obviousness question based on 
the invention as claimed, and the cited references, including all relevant parts thereof. 

B. Standard For Reviewing An Obviousness Rejection under 35 USC §103. 

The Federal Circuit has reiterated the manner in which obviousness rejections are to be 
reviewed. Where claimed subject matter has been rejected as obvious in view of a combination 
of prior art references, "a proper analysis under section 103 requires, inter alia, consideration of 
two factors: (1) whether the prior art would have suggested to those of ordinary skill in the art 
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that they should make the claimed composition or device, or carry out the claimed process; and 
(2) whether the prior art would also have revealed that in so making or carrying out, those of 
ordinary skill would have a reasonable expectation of success." In re Vaeck. 947 F.2d 488, 493, 
20 U.S.P.Q.2d 1438, 1442 (Fed. Cir. 1991), cited In re Dow Chemical Co .. 837 F.2d 469, 473, 5 
U.S.P.Q.2d 1529, 1531 (Fed. Cir. 1988). As the Federal Circuit emphasized by succinctly 
summarizing: "Both the suggestion and the reasonable expectation of success must be founded in 
the prior art, not the Applicants' disclosure." Id. See also In re Merck & Co., Inc.. 800 F.2d 1091, 
231 USPQ 375 (Fed. Cir. 1986). 

More recently, the Federal Circuit has reviewed the case law regarding 35 U.S.C. §103. 
See In re Sang-Su Lee 277 F.3d 1388, 61 U.S.P.Q.2d 1430 (Fed. Cir. 2002). 

Should the Board adopt the Examiner's prima facie case, Appellants submit that the 
claimed invention would not have been obvious in view of the legal standard summarized above. 

C. No Prima Facie Case of Obviousness 

As noted above, the foundation of the Examiner 1 argument rests on the belief that it 
would be obvious to fuse Chung's scTCR to the bacteriophage coat protein of Barbas because, 
allegedly, Barbas and Onda teach TCR-bacteriophage coat fusion proteins and Huse discloses 
that fusion protein with such a coat can be made in bacteria. 

For the Examiner's prima facie case to stand, it is imperative that he establish that: 1) 
The cited references disclose or suggest fusing a bacteriophage coat protein to Chung's scTCR; 
2) there is a settled role for the bacteriophage coat protein in making fusion proteins; and that 3) 
one could fuse Chung's scTCR to Barbas* coat protein with a reasonable expectation of success. 

The Examiner's position is not supported by any of these points. Barbas, as relied on, 
does not teach or suggest a scTCR or even fusion of a scTCR to a bacteriophage coat protein. 
Onda and Huse, when read in their entirety, exemplify uncertainty in the field about using the 
bacteriophage coat protein to make certain fusion proteins. Even Barbas admitted that there was 
some doubt about how much one could change certain bacteriophage coat proteins without 
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hindering phage assembly. Moreover, some of Onda's and Huse 1 molecules did not work well. 
Others did not work at all On top of that uncertainty is heaped additional doubt about whether 
Chung's "anchor" fragments could be substituted with the bacteriophage coat protein of Barbas. 
There was also doubt about whether the bacteriophage would tolerate fusion of the scTCR to its 
coat. 

i) Barbas and Onda does not teach or suggest a single-chain TCR (scTCR) 
In the Office Action dated August 25, 2000, Examiner Schwadron took the position that 
Barbas discloses: 

soluble fusion protein comprising a bacteriophage coat protein fragment covalently 
linked to a single-chain heterodimeric receptor (see the abstract and column 15, lines 
27-28, in particular). Barbas also discloses that the fusion protein may comprise 
domains of heterodimeric proteins derived from several ligand binding proteins, 

including immunoglobulins and T cell receptors (see column 17, lines 62-66 and column 
19, lines, 9-28. Barbas discloses that T cell receptor comprises alpha and beta chains each 
having a variable(V) and constant(C) region and T cell receptor has similarities in genetic 
organization and function to immunoglobulins (see column 19, lines 19-22, in particular). 

sjc s|c sfc «fc 

Thus Barbas discloses but does not exemplify a soluble fusion protein comprising a 
bacteriophage coat protein covalently linked to T cell receptor domains 

The heterodimeric receptor proteins pointed out by the Examiner are not scTCRs. 
Heterodimeric proteins, and particularly the TCR of Barbas, are understood in the field to consist 
of two different a and p chains. Brief at pg. 1 . Unlike the TCR, the scTCR of Appellants' 
claimed invention is a single-chain molecule with a V-a chain fused to a V-p chain. The position 
that Barbas discloses a "single-chain heterodimeric" receptor simply makes no sense. How can a 
single-chain molecule be a "heterodimer" when that requires two (dimer) different (hetero) 
chains? Brief at pg. 1 and footnote 1 . Barbas could not have had the single-chain constructs of 
Onda and Chung in mind. Those references were published well after the priority date of the 
Barbas patent. Thus, nowhere in the reference is there any disclosure about how to make or use 
a scTCR. 

Faced with this rebuttal, the Examiner took the position in the Final Office that Barbas 1 
disclosure of "polypeptides comprising an insert domain" and "receptor proteins" should be read 
to include Chung's scTCRs. Also included in that sweeping reading of Barbas are "single chain 
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or heterodimeric or single chain heteromers". That position is without merit. Too much is read 
from Barbas. It does not provide for any scTCR molecules. If the Examiner's overly-broad view 
of the patent is allowed to stand and sweep in scTCRs, even though Chung's were published well 
after Barbas' priority date, it will preempt any attempt to obtain patent protection for scTCR- 
bacteriophage coat fusion proteins. A principle focus of Barbas was to provide heterodimeric 
receptors linked to a phage coat protein. See the Title of the patent, the Abstract and col. 3, lines 
1-41, for instance. Such receptors are not the fusion proteins Appellants claim and there is no 
suggestion in Barbas to make or use them. 

Even assuming, arguendo, that the Examiner is correct and that Barbas taught or 
suggested a scTCR (years before Chung or Onda were published), one reading Barbas in that 
way would be confused in light of the accepted understanding in the field that a heterodimer such 
as the TCR is a complex of two different polypeptide chains. Brief at pp. 1-2 and footnote 1 . 

As captioned above, Onda does not disclose a TCR or scTCR fusion to bacteriophage 
coat protein as alleged by the Examiner in the August 25, 2000 and June 17, 2002 Office 
Actions. Instead, Onda reports fusion of TCR a chains to bacteriophage coat protein. See 
Exhibit A (ii) showing the TCR a chain as part of a larger scTCR. These constructs are 
significantly smaller (and less likely to cause solubility problems when fused to coat proteins) 
than the scTCR fusions Appellants successfully made. 

Moreover, the Examiner ignored Onda's clear hesitation about reading too much from 
TCR a chain constructs that include a fused bacteriophage coat protein. According to Onda, the 
interactions of the constructs were unusual and not typical of TCR-ligand interactions. See 
above and Onda at pg. 1395, col. 1. 

Importantly, only some of Onda's TCR a chain constructs even worked to bind antigen. 
See above and pg. 1395 of Onda at col. 2, second full paragraph. 

According to Onda then, some TCR a chain-bacteriophage coat protein fusions work and 
some do not. Those that do work were viewed as "unusual" and "not typical". In view of this 
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caution, one working in this field would not be encouraged to fuse a bacteriophage coat protein 
to a scTCR. None of the other cited references shed any light on Onda's clear hesitation to 
extend there findings to other TCR molecules. 

The Examiner thus erred in trying to formulate a prima facie case by not giving due 

weight to all relevant portions of Onda. Contrary to this practice, it is well established that the 

Examiner must consider all relevant portions of cited references, including those portions which 

substantially weaken her position. In particular, the former CCPA stated in In re Mercier 515 

R2dll61, 185 USPQat778: 

The relevant portions of a reference include not only those teachings which would 
suggest particular aspects of an invention to one having ordinary skill in the art, 
but also those teachings which would lead such a person away from the claimed 
invention. 

See also Phillips Petroleum Co. v. U.S. Steel Corp ., 673 F.Supp. 1278, 1315, 6 USPQ2d 
1065, 1093 (D.Del. 1987), affd, 865 F.2d 1247, 9 USPQ2d 1461 (Fed. Cir. 1989). 

The Board is thus urged to take Onda in its entirety and to consider all relevant portions 
of it including the passages quoted above. Read in this way, as it should, the reference would 
lead one in this field to doubt whether it would be feasible to fuse a bacteriophage coat protein to 
a scTCR to produce a soluble and functional fusion protein. 

ii) Huse reported difficulties producing some bacteriophage coat protein fusions 

The Huse reference, as quoted above, reported that not all F(ab)-pVIII (bacteriophage) 
coat proteins could be made at high titre. That is, Huse stated that the bacteriophage may not 
tolerate some amounts of F(ab) constructs, thereby decreasing phage titres and overall F(ab) 
yield. See above and Huse at pg. 3919, col. 2. When Huse is read in its entirety, as it should, 
the Examiner's statement that "Huse et al. teach that fusion proteins comprising bacteriophage 
VIII coat protein can be produced in bacteria" is an unsupported generalization. Huse clearly 
found that some amounts of heterodimeric F(ab) constructs harmed the bacteriophage that 
carried them. In view of this warning, a worker in the field would have good reason to doubt 
whether a bacteriophage could be fused to a scTCR or even a heterodimer such as a TCR without 
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considerable experimentation. 

The Examiner took the position in the Final Office Action that Barbas and Chung provide 

a reasonable expectation that one could make the claimed fusion proteins: 

Regarding reasonable expectation of success, both Barbas et al. and Chung et 
al. disclose use of phage display systems to produce single chain antibodies (see 
column 2, third paragraph from bottom and page 12658, first column). 

However as clearly illustrated by Huse, not all phage display systems using 
antibodies work as expected. Some amounts of heterodimeric F(ab) antibodies cause 
problems. Thus the Examiner's position is not supported by the art of record in this case. 

The Board is thus requested to take Huse in its entirety and to consider all relevant 
portions of it including the passage quoted above. InreMercier 515 F.2d 1 161, 185 USPQ at 
778; and Phillips Petroleum Co. v. U.S. Steel Corp .. 673 F.Supp. 1278, 1315, 6 USPQ2d 1065, 
1093 (D.Del. 1987), affd, 865 F.2d 1247, 9 USPQ2d 1461 (Fed. Cir. 1989). 

The substantial uncertainties raised by Onda and Huse have not been addressed by the 
Examiner. No objective scientific work has been made of record to resolve or explain them. 
Read in their entirety, as they should, Onda and Huse point out problems about making and using 
some bacteriophage coat protein fusions. Even if one skilled in this field were to read Onda and 
Huse selectively and disregard their warnings, there is still nothing in the art relied on to suggest 
that one could make or use a scTCR fusion to a bacteriophage coat protein. Even Barbas 
admitted that there was uncertainty about what coat protein parts could be manipulated for phage 
assembly. 

In marked contrast, Appellants have demonstrated that it is possible to fuse a 
bacteriophage coat protein to the scTCR and obtain fully soluble and functional fusion protein. 
See Appellants' patent specification at Example 1 (showing construction of soluble scTCR fusion 
proteins); Examples 2-3 (production of special vectors to make the scTCR fusion proteins); 
Example s 4-5 (expression of Appellants 1 soluble scTCR fusions); Example 6 (purification of 
the soluble scTCR fusion proteins); Examples 7-1 1 and 16 (characterization of particular scTCR 
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fusion proteins); and Example 15 (analysis of a bacteriophage library expressing Appellants 1 
scTCR). 

Accordingly, the §103 rejection fails both prongs of the Federal Circuit test for 
determining obviousness. See In re Vaeck, supra; and In re Dow Chemical Co ., supra. 
It is submitted that the Board reverse the obviousness rejection in light of this test. 

Whether or not the Examiner is taking the position that it would be obvious to try 
to make the claimed scTCR fusion proteins, both the Board and Federal Circuit have 
made it quite clear that this is not a burden that Appellants must bear. In particular, the 
Court in In re OTarrelL 7 USPQ 2d 1673 (1988) held at page 1681: 

The admonition that "obvious to try" is not the standard under §103 has been 
directed mainly at two kinds of error. In some cases, what would have been 
"obvious to try" would have been to vary all parameters or try each of numerous 
possible choices until one possibly arrived at a successful result, where the prior 
art gave either no indication of which parameters were critical or no direction as 
to which of many possible choices is likely to be successful. 

See also Ex parte Old , 229 USPQ 196, 200 (1985). 

While the Court in* In re OTarrell went on to state that while obviousness does not 
require absolute predictability of success, what is required under §103 is a reasonable 
expectation of success. 

Thus whether those in the field may have been tempted to fuse a bacteriophage 
coat protein to a scTCR, the field would have been cautioned from doing so in view of 
the warnings of Onda, Huse and to some extent even Barbas. The Barbas patent does not 
disclose or suggest any scTCR fusion to the coat protein. Chung does nothing to remedy 
these defects. In short, the field's unsuccessful experience with some amounts of 
antibody heterodimers (Huse) and some single-chain constructs (Onda), provides at worst 
no basis for believing that fusion of a bacteriophage coat protein to an scTCR will work 
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and at best, a reason to doubt that such a fusion will result in a fully soluble and 
functional protein. 

iii) No teaching or suggestion that Chung's "anchor" fragment could be 
substituted with Barbas 9 bacteriophage coat protein 

Chung reported TCRs linked to a cell membrane anchor (glycosysl phosphatidylinositol 
(GPI) or murine CD3 £ chain). The anchor apparently helps to express the single-chain TCRs. 
The anchor molecules are entirely different from the coat proteins of Barbas both in terms of 
chemical structure and function. For example, Chung's anchors are hydrophobic cell membrane 
proteins while those of Barbas are relatively more hydrophilic bacteriophage coat components. 
Chung's anchor apparently plants the scTCR in the membrane while the coat envelops the phage. 

The Examiner has pointed to no teaching or suggestion in the cited art that Chung's 
anchor molecules could be substituted with Barbas 1 bacteriophage coat proteins. The 
obviousness rejection falls far short of establishing any nexus between Chung's anchors, which 
are attached to his scTCRs, and the coat proteins reported by Barbas. 

iv) Objective Evidence of Non-obviousness 

In addition to the lack of a prima facie case of obviousness, the strong objective 
evidence of non-obviousness presented during prosecution of this case further compels 
allowance of the claims. 

Evidence of such objective indicia of non-obviousness, the so-called "secondary 
considerations" must be considered in all obviousness determinations. Stratoflex, Inc. v. 
Aeroquip Corp. . 713 F.2d 1530, 1538-1539 (1983): 

Indeed, evidence of secondary consideration may often be the most probative and 
cogent evidence in the record. It may often establish that an invention appearing 
to have been obvious in light of the prior art was not. It is to be considered as part 
of all the evidence, not just when the decision-maker remains in doubt after 
reviewing the art. 
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See also Graham v. John Deere. 383 U.S. 1, 148 USPQ 459 (1966). 

This standard set forth by the Federal Circuit applies not only during litigation of 
issued patents, but to a determination of patentability during ex parte prosecution as well. 
In re Sernaker. 702 F.2d. 989 217 USPQ 1, 7 (Fed. Cir. 1983). However, in the instant 
case, the Examiner is not properly considered evidence of "long felt need and failure of 
others" in maintaining the present § 103 rejection. 

Specifically, Appellants 1 provided the Holler reference as indicating that the field 
longed to make the claimed fusion proteins but could not. Holler reported that phage 
display had not yet proven successful in making scTCRs despite what he saw as 
extensive structural similarity between antibodies and TCR V regions. Appellants 1 
invention addressed this need and succeeded by providing soluble fusion molecules with 
a bacteriophage coat protein linked to the scTCR. The Holler reference is highly 
probative of the difficulties the field had in making these molecules and should be given 
substantial weight by Examiner Schwadron. MPEP 716.01(b). 

The Examiner completely dismissed the Holler reference on grounds that "Holler 
et al. simply not familiar with the prior art". See Part D, above. That is no basis for 
disregarding the Holler's statement that the field wanted but failed to produce the claimed 
invention. Appellants are under no burden to provide evidence of Holler's knowledge of 
the art in order to have the reference considered as objective indicia of non-obviousness. 
See Stratoflex, Inc. v. Aeroquip Corp. . 713 F.2d 1530, 1538-1539 (1983); InreSemaker, 
702 F.2d. 989 217 USPQ 1, 7 (Fed. Cir. 1983); and MPEP 716.01(b). 

Moreover, Examiner Schwadron's citation of Weidanz et al. to support his 
disregard of Holler is clearly improper. That reference is not prior art and cannot serve as 
a basis for ignoring Holler or substantiating the obviousness rejection on appeal. 

It is requested that the Board consider Holler as objective evidence that workers in the 
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field wanted, but could not make, the claimed invention. 

CONCLUSIONS 

For the Examiner's prima facie case to stand, he has the burden of showing that: 
1) The cited references disclose or suggest fusing a bacteriophage coat protein to a 
scTCR; 2) there is a settled role for the bacteriophage coat protein in making fusion 
proteins; and that 3) one could fuse Chung's scTCR to Barbas' coat protein with a 
reasonable expectation of success. These points have not been made by the Examiner. 
As discussed above, Barbas does not disclose scTCRs. Moreover, there was significant 
uncertainty in the field about whether it was possible to fuse a bacteriophage coat protein 
to a scTCR as exemplified by Huse and Onda. In view of the cited art and in 
consideration of the Examiner's position, it could be argued that one might be motivated 
to test fusing the bacteriophage coat protein to Chung's scTCR in the hope of producing a 
soluble and function protein. But this is not the legal standard required by our case law. 
It is without a doubt not obvious from the art of record to make the claimed invention of a 
scTCR fused to a bacteriophage coat protein. 

Appellants submit that they have overcome the Examiner's obviousness rejection 
in the view of all the facts and argument of record in this case. Simply put, one of skill in 
this area would not be able to predict, with any reasonable expectation of success, how to 
make and use the claimed invention. 

Importantly, Appellants have provided experimental evidence clearly showing that it is 
indeed possible to make and use scTCR-bacteriophage coat protein fusions. See Examples 1-11, 
15 and 16 as discussed above. 

In summary, Appellants submit that the instant invention is both novel and unobvious. 
The arguments set forth above establish that non-obviousness. 

The prior Appeal Brief submission was mailed on February 19, 2003 which was 
the next business day after two day Federal holiday (Feb. 17 was Presidents' Day and the 
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PTO was closed on Feb. 18 due to a weather emergency). Accordingly, submission of 
the prior Brief was timely and within the two-month extension period. The present 
submission of a revised Brief is also believed to be timely. 

Although it is not believed that the present submission requires any fee for 
consideration by the Office, the Examiner is authorizied to charge such fee to our deposit 
account 04-1105 should such fee be deemed necessary. 

Respectfully submitted, 
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EXHIBIT A 
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EXHIBIT B 
Claims 1, 2, 4, 7, 8, 14, 67, 69, 71 and 72 
on appeal 

What is claimed is: 

1 . A soluble fusion protein comprising a bacteriophage coat protein covalently linked to 
a single-chain T cell receptor comprising an antigen binding pocket, wherein the single-chain T 
cell receptor comprises a V-a region covalently linked to a V-p region by a peptide linker 
sequence that effectively positions the V-a region and the V-p region to form the antigen 
binding pocket, the soluble fusion protein further comprising a C-p region fragment. 

2. The soluble fusion protein of claim 1, wherein the C-terminus of the V-a region is 
covalently linked by the peptide linker sequence to the N-terminus of V-0 region. 

4. The soluble fusion protein of claim 2 wherein the C-P region fragment is covalently 
linked between the C-terminus of the V-P region and the N-terminus of the bacteriophage coat 
protein. 

7. The soluble fusion protein of claim 2, wherein the peptide linker sequence 
contains from approximately 2 to 20 amino acids. 

8. The soluble fusion protein of claim 1 ? wherein the bacteriophage coat protein is 
gene III or gene VIII protein. 

14. A soluble fusion protein comprising covalently linked in sequence: 1) a V-a 
region, 2) a peptide linker sequence, 3) a V-P region covalently linked to a C-p region fragment, 
and 4) a bacteriophage gene VIII protein, wherein the peptide linker sequence effectively 
positions the V-a region and the V-p region to form an antigen binding pocket. 

67. The soluble fusion protein of claim 1, wherein the C-terminus of the V-p region 
is covalently linked to the N-terminus of a C-P region fragment. 
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69. The soluble fusion protein of claim 1 , wherein the V-ct region and the V-p region 
are about 200 to 400 amino acids in length. 

71 . The soluble fusion protein of claim 1 , wherein the C-P region fragment is about 
50 to 126 amino acids in length. 

72. The soluble fusion protein of claim 70, wherein the C-p region fragment does not 
include a cysteine residue corresponding to position 127 of a full-length C-p region. 
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We describe the application of a novel filamentous 
phage vector system suitable for efficient screening 
and production of F(ab) antibody fragments. The 
vector system can concurrently produce free F(ab) 
fragments and F(ab) displayed op the surface of Ml 3 
bacteriophage via a V„C H l-pVIII fusion protein. 
When expressed in a supO (nonsuppressor) strain of 
Escherichia coli free F(ab) can be produced. Anti- 
body F(ab) fragments are secreted into culture me- 
dium at concentrations up to 0.3 mg/liter and con- 
veniently subjected to detailed analysis with little 
or no purification. Higher concentrations of F(ab) 
approximately 10 mg/liter) were found to accumu- 
late in the periplasmic space. In this report the 
vector system is shown to produce correctly folded 
uid assembled F(ab) fragments of chimeric L6, a 
cnAb against a tumor-associated Ag expressed by 
many human carcinomas. 

Until recently mAb have been primarily produced in 
Tiammallan cells. The slow growth rates and difficulty 
n genetically manipulating antibody genes expressed in 
mammalian cells have motivated development of meth- 
ods to express antibody genes In simpler organisms. Mo- 
ecular cloning techniques in bacteria have facilitated the 
production and manipulation of antibody fragments, in- 
creasingly aiding the search for useful antibodies (1. 2). 
Methods have been reported for the expression of anti- 
body fragments in Escherichia coli using plasm ids (3- 
7). bacteriophage X (8-11). and more recently the fila- 
mentous phage Ml 3 (12-17). Very large combinatorial 
lbrartes of 10 5 to 10" distinct antibody specificities can 
^e created in microorganisms, far greater than can be 
ichieved with hybiidoma cell fusion methods (9, 10). 
lowever, the rapid identification, isolation, and, if nec- 
jssary, modification of antibodies with the goal of im- 
proving affinity or redirecting specificity for Ag depends 
leavtly on the availability of powerful screening methods 
x>th In terms of sampling large numbers of antibody 
ragments and evaluating certain aspects of antibody 
binding. Hence, an antibody expression vector that per- 
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mits rapid cloning and mutagenesis of antibody V region 
genes and produces sufficient levels of antibody for bio- 
chemical analysis would be highly desirable. 

Because antibody fragments can be displayed on the 
surface of filamentous phage, bacteriophage Mid vectors 
are proving particularly valuable in creating and screen- 
ing sequence libraries for antibody fragments of Interest. 
Briefly, fusion proteins are created by Inserting DNA 
encoding an antibody fragment in front of a phage coat 
protein gene (18, 19). The fusion proteins become an- 
chored in the phage coat via the coat protein and antibody 
sequence is displayed at the phage surface. Phage-bear- 
ing antibody sequences of Interest can be detected by Ag 
binding and isolated in infectious form (13, 14). In addi- 
tion to antibodies the Incorporation of malarial protein 
(20), growth hormone (21), and a hexapeptide library (22) 
into the coat proteins of filamentous phage has been 
reported. 

We have developed an Ml 3 filamentous phage vector 
system that can produce and display F(ab) as a fusion 
product to pVm coat protein and can also synthesize free 
F(ab) in quantity. We report here the production of chi- 
meric L6 antibody F(ab) fragments in these M13-derlved 
vectors. L6 Is a mAb against a tumor-associated cell 
surface Ag expressed by many human carcinomas (23). 
L6 has been shown capable of lysing cancer cells in vitro 
(24) and is currently the subject of clinical trials (25). 

MATERIALS AND METHODS 

Construction of bacteriophage M13IXL604 for expression of LS 
Ftabh Restriction enzymes, calf Intestine alkaline phosphatase, T4 
polynucleotide kinase, T4 DNA polymerase, and T4 DNA ligase were 
purchased from Boehrlnger-Mannhelm (Indianapolis. IN). Total RNA 
was Isolated from the chimeric L6- secreting cell Une described by 
Fell et al. (26) by the guanldlntum thtocyanate-phenol/chtoroforrn 
method (27). First strand cDNA was synthesized using ollgo dT and 
BRL Superscript reverse transcriptase (GIBCO BRL. Grand Island. 
NY) and PCR* amplification of L6 H chain (V„-Cm1 ) and L chain (V L - 
Cm) sequences was performed by the method of Saiki et al. (28) as 
modified by Sastry ct al. (29). The following primers were used for 
PCR amplification. The restriction endonuclease recognizing the 
boldface and underlined cloning site In the sequences is Indicated 
within the parentheses: forward V„ primer (Xftol): S'-CACTCTGGA- 
CCTG AGCTCGAQAAGCCTGG AG AG-3 / : forward V u primer {NcoT) 

* Abbreviations used In this paper: PCR. polymerase chain reaction: 
gvm. gene vm coding for the major coat protein of M13 phage; pVm. 
major coat protein of M13 phage; ChI H chain C region 1.: V u L chain V 
region: IPTG. lsopropy 1-0-o-thioga lactopyranoaide; V M . H chain V region: 
MOPS. 3^N-inorpboUno)propanc sulfonic add: TES. 30 mM Tris-HCL 2 
mM EDTA. 2% sucrose (w/v). pH 8.0: sup£. strains of £ coli that carry a 
ghitamlne Inserting amber (UAG) suppressor tRNA: MES, 20 mM 3-(N- 
morphollno)pn>pane sulfonic add. 2 mM EDTA. 20% sucrose (w/v), pH 
7.5; supO. strains of £. coli that do not carry a suppressor tRNA. 
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S ' -OCCCAACCAGCCATOQCCCAAATTQTTCTCTCCCAGTCT-3* ; 
reverse CI pr^cT^Spe^-TCT^^ 

GGGCTCTGC-3': reverse Gt primer (Xbol): 5'-CCGCTrAACTCTj 
AGACT AACACTCTCCCCTQTTCAAGCT-3' . The V„-Ch1 and V l -0 
cTuHiia were digested and Ilgated into prepared Ml 31X31 and 
M13DC12 vectors (30) resulting In M 1 3DC3 1 L3-H and M13DC1 2L6-L. 
respectively (Fig. 1). Sequence correcUona of the cloned L6 V„-Q,l 
and V L -C* chains were accomplished by site-directed mutagenesis 
of uracil-substltuted ssDNA as described (31. 32). The correcting 
nucleotides are Indicated by the underlined sequences. Five N-ter- 
mlnai V M amino adds were lnadvertenUy omitted from the original 
L6 sequence Information resulting In an 1 1 -amino acid deletion upon 
cloning. One V„ correction primer was 5 ' ^TCTCCAGGci ICTTC- 
AGCTCAGCTXX?AGA GGCTTTTG(C/T)CAC-3'. This primer cor- 
rected the XhcA cloning site back to the orlglnaJ L6 sequence and 
replaced V„ codons 7 to 10. The C/T mixed site introduced an amino 
acid change In the leader sequence that was found to Increase 
expression. The following V„ primer served to restor e the remaining 
six N-termlnal amino adds 5 ' <!AGCTCAGGTtXAGACTGCACCAAC- 
TGGATCTCOGCCATOGCTGOTTGGGC-3'. The following V L primer 
served to "correct an Incorrect nucleoti de In the reverse C* PCR 
primer: 3 ' -ACTCTCCCCTGTTGAACCTCTTTGTGA-3' . The H chain 
encoding M 1 31X3 1 L6-H and L chain encoding M13IX1 2L6-L vectors 
were combined by annealing as described (30) to form M131XL604 
(Fig. 1). M131XL605 was derived from M131XL604 by mutation of 
the TAG (stop) codon. located between the H chain encoding se- 
quence and the pseudo wild-type gene VIII. to GGT (glycine) using 
the oligonucleotide 5 ' -CGCCTTC AGC ACCGC ATCC ACT AGT-3 ' so 
that continual V„-C«l-pVIII fusion protein would be made. DNA 
sequence analysis of ssDNA prepared from phage Isolates was per- 
formed with Sequenase Version 2 according to the manufacturer 
(United States Biochemical. Cleveland. OH). 

Antibodies and reagents. The anti-Id mAb to L6 have been 
previously described (33). Anti-Id I la a T 2b Isotype. antl-ld 3 la a 
72a Isotype. and both antl-ld 7 and antl-ld 13 are 7 1 Isotypes. 
Alkaline phosphatase -conjugated antibodies were purchased from 
Fisher Biotech (San Francisco. CA). Unconjugated goat anti-human 
* antibody was purchased from Caltag Laboratories (So. San Fran- 
cisco. CA). Rabbit antl-M13 IgG was purified by Sepharose-proteln 
A chromatography and exhaustively absorbed against whole E. colt 
The anti-M13 antibody was subsequently btoilnylated using D-b4o- 
tlnoyl-c-amlnocaproic acid N-hydrcorysucclnlmlde ester (Boehrlnger- 
Mannhelm) using standard chemistries. Vectastaln avldln-horse- 
radlsh peroxidase complex (Vector Laboratories, BurUngame. CA) 
and streptavldln-alkaltae phosphatase complex (Boehringer-Mann- 
heim) were used for second step reactions. 

Screening by replicate JUter lifts. M 1 31XL604 phage were plated 
at a low plaque density. F(ab) expression was Induced by overlaying 
the plate with a 0.45-m nitrocellulose filter (Schleicher and Schuell, 
Keenc. NH) soaked In 10 mM I PTC and Incubating at room temper- 
ature from 6 h to overnight. The filter was removed and placed In 
blocking buffer (Bioslte Diagnostics. San Diego. CA) to block nonspe- 
cific binding sites. Phage growth was resumed by Incubating the 
plate for an additional 2 h at 37*C and a second filter appUed as 
described above. This procedure was repeated for the last filter and 
all filters were then blocked In blocking buffer. All monoclonal or 
polyclonal antibodies to be used for screening were diluted In block- 
ing buffer. Filters were probed with either alkaline phosphatase- 
conjugated goat antibody to human X -chain, alkaline phosphatase- 
conjugated goat antibody to human *-chaln. or antl-ld 3, which binds 
to L6 antibody, in the case of antl-W 3. a secondary alkaline phos- 
phatase-conjugated goat antibody to mouse IgT^a was used for de- 
tection. All filters were then washed three times for IO mln with 25 
mM Tris, 0.137 M NaCl. 5 mM KCI, 0.9 mM CaClj. 0.5 mM MgCli. 
and 0.05% Tween 20 (pH 7.4) and developed with alkaline phospha- 
tase substrate reagent (Blo-Rad. Richmond, CA). 

Ffab) production and purification. For analytic scale production 
of F(ab) the supE amber suppressor strain XL-1 (Stratagene. San 
Diego. CA) and the supO nonsuppressor strain MK30-3 (Boehrlnger- 
Mannheim) were each grown In 2X YT medium at 37*C until the 
cultures reached a density of 0.4 to 0.6 at OD» Each strain was 
then diluted 1/10 Into three culture tubes containing 3 ml 2X YT 
and Infected with 3 *d of high titer (I0 il plaque-forming units/ml) 
phage stock of M13DCL604. M13DCL605. or M131X31/tube and In- 
cubated with shaking for 3 h at 37*C. Protein synthesis was Induced 
by the addition of IPTG to a final concentration of 1 mM and shaking 
allowed to proceed for 10 to 14 h at ambient temperature. Perl- 
plasmlc fractions were prepared essentially as described by Skerra 
and Pluckthorn (5). The Infected cultures were centrtfuged for 10 
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figure 1. Construction of M 1 31XL604 vector for expression of L6 Fab. 
The M131XL604 vector product contains one copy of pseudo wild-type 
gVlfl (gVWH and one wild-type gvm (gvm) both downstream from the H 
chain encoding region. gVHP* has been altered In sequence to reduce 
homologous recombination with gvm contained on the same vector. The 
presence of gVUI reduces the likelihood of selection against certain fusion 
proteins as the result of co m promised phage viability (34). 

mln at 4°C and the cleared culture supernatant was reserved. The 
bacterial pellet was resuspended In 60 ml cold TES and digested on 
ice for 1 0 mln af ter adding an equal volume of a cold freshly prepared 
solution containing 2 mg/ml lysozyme in TES. The perlplaamlc space 
containing free F(ab) was fractionated by centrtf ligation at 9000 rpm 
for 10 mln at 4*C The soluble periplasmlc fraction was retained and 
diluted in blocking buffer to adjust the concentration of F(ab) to be 
equivalent to that found In the cleared culture supernatant 

For purification of L6 F(ab) a 1 -liter culture of MK30-3 was grown 
and Infected with M131XLG04 as described above. The cells were 
harvested by centrtfugation at 5800 x g for 10 mln at 4"C. The pellet 
was resuspended In 20 ml MES at ambient temperature. An equal 
volume of 2 mg/ml lysoryme In MES was added wtth mild vortexing 
and the suspension Incubated at ambient temperature for 10 mln. 
The soluble perlplasmic fraction was isolated by centrtfugation at 
0700 x g for 10 mln at 4*C The perlplasmic fraction was subjected 
to a second centrtfugation at 12.000 x g f or 30 mln at 4«C. The 
cleared periplasmlc fraction was loaded onto a macroprep 50 S 
support (Blo-Rad) washed extenstvely with 20 mM MOPS. pH 7.5 
and ehited with 20 mM MOPS. 120 mM Nad. pH 7.5. The partially 
purified LB F(ab) was concentrated by centrtfugation In a Centrlcon 
30 device (Amicon. Beverly. MA) and size fractionated on a Biogel P- 
60 (Blo-Rad) column equilibrated and eluted with 20 mM MOPS. 120 
mM NaCl. pH 7.5. Eluted L6 F(ab) fractions were assayed for binding 
to antl-ld 3 by EUSA. Fractions elutlng at 8. 9. and 10 ml were then 
pooled and concentrated. Samples from each purification step were 
analyzed by SDS-PAGE on a 1096 nonredudng gel followed by stain- 
ing with Coomassie brilliant blue. 



SCREENING AND PRODUCTION OF F\ab) IN A PHAGE VECTOR SYSTEM 



EUSA characterisation off US F\ab) produced by M13IXL604. 
J13D0604 phage stocks wtre prepared for assay by overnight 
ifcctton of XL-1 In L8 broth and 10 fig/ml tetracycline to maintain 
xpreaaion of F\ and 1 mM IPTG. The bacteria were removed by 
en tnf ugatlon (10 mln in a mlcrofuge) and the phage solution was 
saayed for binding to L6-spec1f Ic anU-ld 1,3.7. 13, and to an antl- 
-chain antibody by EUSA. All antibodies were coated onto Immukm 
! nUcrotlter plates (Dynatech Laboratories. Chantllly, VA) at lO^g/ 
il In 0. 1 M NaHCO-., pH 8.5, overnight at 4°C. The antibody solution 
/as shaken out and the plates were blocked with 300 M I of specimen 
•locking buffer (Genetic Systems. Seattle, WA) Tor 1 h at room 
cmperature. Plates were washed before use with 0.5% Tween 20 In 
.15 M NaCl and with the same solution between incubations. Phage 
llutlons were dispensed Into microliter plates In 100 pi of blocking 
uffer and Incubated overnight. After washing. 100 >J of conjugate 
locking buffer (Genetic Systems) containing 1 *g/ml of blotlnylated 
abblt antibodies to Ml 3 were Incubated 1 h at am Went temperature, 
ifter washing. 100 ?\ of Vectastain avidln-HRP complex were In- 
u bated 30 mln at room temperature.' After a final wash step, 100 
1 of chromogenlc substrate (3.3'.3.5'-tetramethylbenzldlneJ In a 
ItraU/phosphate buffer were added. The reaction was stopped with 
00 pi of 3 N mS0 4 at various times ranging from 10 min to 2 h to 
chleve an optimum signal to background ratio for each different 
late-coating antibody. Plates were read on a mlcroplate reader 
3lotck. Burlington. VT) In dual channel mode at 450/63O nm. All 
ssay points were measured In duplicate. Ml 31X31 served as a 
.egaUve control. 

The expression of M13IXL604. M13IXL605, and M131X31 In 
acterial strains MK30-3 and XL-1 was also analyzed by ELISA. 
:ulture supernatant and diluted peiiplasmlc fractions prepared from 
ifected XL-1 and MK30-3 bacteria described above were serially 
lluted In diluent (Bloslte Diagnostics), added to mic route r plates 
oated with anti-Id 3 and Incubated for 2 h at ambient temperature. 

6 F(ab} expressed In the culture medium or In the periplasmlc space 
ras detected with an alkaline phosphatase-conjugated goat antibody 

7 human « -chains. Expression of L6 F(ab) displayed on the phage 
urface was detected by incubating the captured sample first with 
he blotlnylated rabbit antibody to M13 and then with streptavldln- 
Ikallne phosphatase complex. The washed plates were developed 
rtth 6 mg/ml phenolphthalein monophosphate in 0.1 M aminome- 
lylpropancdlol, 0.5 M Tria. and 6. 1 % NaN„ pH 1 0.2 (JBL Scientific, 
an Luis Obispo. CA) for 10 to 30 mln. The reaction was stopped by 
dditlon of one-third volume of cold 30 mM Tris base. 6 mM EDTA. 
nd the absorbance at 560 nm measured. 

RESULTS 

Construction of L6 F(ab) expression vector 
H31XL604. The construction of M13IXL604 Is shown 
n Figure 1. Total RNA was isolated from the chimeric 
■6-secretlng cell line and cDNA synthesized. After PCR 
mpllflcatlon of the cDNA using sequence- specific 
rimers, the chimeric L6 V t -Cr L chain was digested to 
ompletion with JVcol and Xbal and cloned Into the Ncol/ 
Coal site of Ml 31X1 2 to construct M13IX12L6-L. Slmi- 
u-ly. the chimeric L6 V h -Ch1 H chain PCR product was 
lgested to completion with Xhol and Spel and cloned 
ito the Xhol/ Spel site of M13EX31 to construct 
113IX31L6-H. Ml 31X31 contains an amber stop codon 
>cated directly 5' of a modified gVlil gene (pseudo-gVUI) 
needing mature Ml 3 pVUl major coat protein. Cloning 
ntibody V H or Vh-Cm1 regions into the Xhol/Spel site of 
1131X31 abuts these antibody sequences In frame with 
he amber stop-pseudo gVIIl sequence. Thus, 
U31X31L6-H should produce L6 Vk-ChI-pVTO fusion 
roduct when expressed In an £. coli amber suppressor 
train such as XL-1 and produce predominantly free L6 
'w-ChI protein when expressed In a nonsuppressor 
train such as MK30-3. The H and L chain vectors were 
hen recomblned into the single M13IXL604 expression 
hage by annealing HindlU/74 DNA polymerase-digested 
1131X12L6-L to M/uM/T4 DNA polymerase-digested 
! 1 3DC3 1 L6-H through the homologous regions found be- 
yeen the Htndm and MIul restriction sites contained In 
oth vectors. (30). Correct L6 H and L chain sequences 



were confirmed by DNA sequence analysis. The vector 
Ml 3DO£05, which contains a GGT codon coding for 
glycine In place of the amber stop found In M13IXL604, 
was constructed by site-directed mutagenesis of 
M13IXL604. Thus, M13IXL605 should produce L6 V«- 
ChI -pVITI fusion product irrespective of the bacterial host 
used for expression. 

Functional characterization and purification of 
M13IXL6Q4-expressed L6 Hob). M13DO604 phage ex- 
pressing L6 F(ab) was initially characterized by replicate 
filter lift assays (Fig. 2). XL-1 bacteria were infected at a 
low multiplicity of Infection resulting In low plaque den- 
sity of phage. No positive plaques were detected by filter 
lift assay when probed with alkaline phosphatase-con- 
jugated goat antibody to human AL chain (Fig. 2A). In 
contrast, numerous, superimposable, uniform positive 
plaques were detected .in assays of human it-chain (Fig. 
2B) and anti-Id 3, which recognizes assembled L6 H and 
L chain (Fig. 2C). Thus, screening by replicate filter lifts 
allows for detection of functional F(ab) expressed In this 
system. 

To further evaluate the functional integrity of L6 F(ab) 
expressed by M13IXL604. phage displaying L6 F(ab) on 
the phage surface were assayed for binding to a panel of 
distinct mouse monoclonal anti-Id raised against the 
mouse mAb L6. ELISA experiments shown in Figure 3 
demonstrated that four anti-Id antibodies specific for L6 




Flour* 3. Screening for L8 F(ab) reactivity by replicate filter lifts. 
M131XL604 phage wu replated at low density for screening. Replicate 
fitter lifts were prepared and probed with A) alkaline phosphataae-<on- 
Jugated goat antibody to human X -chain. B) alkaline phosphatase-conju- 
gated goat antibody to human « -chain, and C) an a -Id 3 antibody. The 
filter In C was washed and Incubated with alkaline phosphatase-conju* 
gated goat antibody to mouse Igr2a. Filters were then washed and devel- 
oped with alkaline phosphatase substrate reagent. 
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arc recognized by the fusion protein displayed on the 
M13IXL604 phage. In the EUSA format used. 
M13DCL604 phage were grown In the suppressor bacte- 
rial strain XL-1. and phage displaying incorporated L6 
F(ab)-pVIII protein are captured by solid phase anti-Id 
antibodies and detected with anti-M13 antibodies. Thus, 
only phage-bound L6 F(ab) is detected. The four anti-Id 
antibodies exhibit different binding specificities to L6 
(33). Anti-Id 1 and 7 bind to L6 L chain, 13 to H chain 
and 3 to assembled L and H chains. The binding of 
M131XL604 phage to an anti-«-chain antibody further 
confirmed the assembly of L6 L chain with L6 H chain- 
pVin fusion protein to yield F(ab), because L6 L chains 
are of the * type (Fig. 3). Ml 31X31. which contains no H 
chain-encoding region served as a negative control for 
each anti-Id. A representative curve shows no Ml 31X31 
binding. The ELISA results in Figure 3 thus indicate the 
presence of both L6 L and H chains and confirm that 
proper folding and assembly of L and H chains on the 
phage surface have been achieved. 

Purification of L6 F(ab) fragments produced by 
M13IXL604 phage grown in the nonsuppressor bacterial 
strain MK30-3 was accomplished by loading the crude 
periplasmie fraction prepared from a 1 -liter shake flask 
culture onto a cation exchange resin. The eluted material 
was concentrated and size fractionated by gel exclusion 
chromatography. Figure AA shows the elution profile 
from the gel exclusion column as assayed by ELISA for 
functional binding to anti-Id 3 using anti-human * to 
detect captured L6 F(ab). The purified F(ab) was shown 
to have a Af r of 42.7 kDa by SDS-PAGE, a value consistent 
with the 48-kDa size of known F(ab) fragments (Fig. 4B). 
Based therefore on both functional and physical proper- 
ties, the Identity of the purified material is chimeric L6 
F(ab). 

Regulated expression of M13IXL604 production of 
Ftab) and phage displayed Ftab). The proper activity of 
M13IXL604-expressed L6 F(ab) having been established, 
the functioning of the expression control system of 
M13IXL604 was tested. This involved comparing levels 
of fusion F(ab) with free F(ab) produced when phage were 
grown in suppressor strains (supE) vs nonsuppressor 
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Figure 3. ELISA characterization of L6 F(ab) produced by M 1 3IXL604. 
Various dilutions of phage expressing L€ F(ab) on the phage surface were 
evaluated for binding to anti-Id 1, 3, 7. and 13 and to an ana- human *- 
chain antibody. Bound phage were then detected as described In Mate- 
rials and Methods. Curves represent the average of triplicate assays. 
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Figure 4. Purification of L6 F(ab) expressed by M 1 3 1XL604- infected 
MK30-3 bacteria. A) elution profile of L6 F(ab) fractions assayed for 
binding to anti-Id 3 antibody by ELISA. Fractions elutlng at 8. 9. and 10 
oil were then pooled and concentrated. B) SDS-PAGE analysis of purified 
Fab. Samples from each purification step were analyzed by SDS-PAGE 
on a 10% nonredudng gel followed by staining with Coomassle brilliant 
blue. The L6 chimeric F(ab) standard was prepared from whole IgG by 
papain digestion followed by protein A chromatography (Pierce. Roc Word. 
IL). 

strains {supO) of bacteria, respectively. This comparison 
was made using ELISA assays employing anti-Id 3 to 
detect assembled L6 F(ab). F(ab) displayed on the surface 
of M 1 3 phage was detected using the M 1 3 assay described 
in Figure 2. M13IXL605, a variant construct of 
M131XL604 that has the amber stop codon replaced with 
a glycine encoding codon and therefore always synthe- 
sizes Vh-ChI-pVDI protein, served as a control. 

When phage were grown in the supE strain XL-1 , both 
M13IXL604 and M13IXL605 secreted phage displaying 
F(ab} into the culture medium (Fig. 5 A) and, as expected, 
little or no mature phage displaying F(ab) was detected in 
the periplastnic space (Fig. SB). In the supO strain 
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r lgure5. Characterization of the 
31XL604 expression control system, 
uld cultures of the amber sup pres- 
et rain XL-1 (supE) and the non- 
presaor strain MK30-3 {supO) were 
h Infected with either M131XLG04. 
HXL605. or M13TX31 and culture su- 
natant and perlplasmic fractions pre- 
ed as described In Materials and 
Aodj. Culture supernatant and perl- 
Hn from infected cultures were as- 
ed for functional binding to anti-Id 3 
Flab) displayed on phage surface [A to 
ind by total F(ab) (£ to H). 
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t30-3. the amber stop codon In M131XL604 should 
rvent fusion protein formation. Figure 5C shows phage 
playing F(ab) to be sharply reduced In M13IXL604 vs 
3DCL605. The low signal seen In the M13DCL604 cul- 
*e medium is presumably caused by nonspecific asso- 
tion of. L6 F(ab) with phage particles and/or trans la- 
nal readth rough. Again no mature phage displaying 
ib) was found in the periplasm (Fig. 5D). In examining 
; production of F(ab) by the anti-Id 3 A -chain assay 
th M13IXL604 and M131XL605 secreted antl-«-reac- 
c material Into the culture medium when expressed in 
her XL-1 or MK30-3 (Figs. 5E and 5G). Because the 



Dilution 

efficiency of amber suppression can be widely variable, 
we expect that M 1 31XL604 expressed In XL-1 should 
secrete free F(ab) In addition to F(ab) displayed on the 
surface of phage. Thus, the signal in Figure 5E should 
represent some combination of the two molecules. Poly- 
ethylene glycol precipitation of both M131XL604 and 
M131XL605 phage from Induced XL-1 culture superna- 
tants showed that approximately 95% of the anti-*-reac- 
tive material remained in the polyethylene glycol-cleared 
supernatant (data not shown) indicating that the predom- 
inant secreted product is free L6 F(ab). Consistent with 
the view that M13IXL604 and M13DCL605 secrete free 
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L6 F[ab). abundant amounts of soluble assembled F(ab) 
were detected In the periplasmic space In both strains of 
E. coll (Fig. 5. Fand H). MK30-3 appeared to consistency 
produce higher concentrations of F(ab) than XL- 1. Quan- 
titative ELISA demonstrated that the concentration of 
free F(ab) produced by M13IXL604 grown in the supO 
strain MK30-3 approaches values up to 0.3 mg/liter In 
culture supernatant and concentrations approaching 10 
mg/liter within the periplasmic space (data not shown). 

DISCUSSION 

M 1 31X3 1 (for H chains) and M131X12 (for L chains) are 
the M13 vectors that can receive polyclonal sets of V 
regions for constructing combinatorial libraries or indi- 
vidual antibody genes for V region production and/or 
mutagenesis (30). The system Is the first to allow differ- 
entially controllable production of free and fusion F(ab). 
Upon recomblnlng the two vectors, all of the control 
elements required for regulated expression of phage dis- 
played F(ab) or free F(ab) are contained in a single expres- 
sion vector. For phage display F(ab) is synthesized as a 
fusion product to pVm coat protein, the major structural 
protein of the filamentous Ml 3 phage particle. In the 
Ml 3 bacteriophage the major coat protein pVin is ex- 
pressed In several thousand copies per phage particle. 
Although this level of expression may be useful for pro- 
ducing F(ab) as F(ab)-pVIIl fusion proteins, pVIH synthe- 
sized solely as a fusion product Is not likely to form a 
properly assembled coat as a result of steric hindrance 
by the much larger H chain polypeptide. We reasoned 
that if wild-type pVm. In addition to F(ab)-pVm fusion 
product, is available for assembly then the formation of 
a mature infectious phage particle could occur. With this 
in mind, the H chain-gVIH expression vector Ml 31X31 
was constructed to contain two copies of gVIII. The copy 
that anchors F(ab) to the surface of Ml 3 has had gVIII 
codons extensively altered (pseudo gVIII) to prevent re- 
combination with the wild-type gVUl. 

To demonstrate the utility of these vectors we cloned 
the antibody V L -C*- and V H -C H 1 -encoding sequences from 
the chimeric L6 transfectoma cell line into the Ml 31X1 2 
and Ml 31X31 vectors, respectively. The two vectors were 
recombined to create M13IXL604. M13IXL604 expresses 
a dicistronic message encoding both H and L chain L6 
sequences under transcriptional control of an inducible 
Lac promoter. An amber stop codon resides between the 
H chain-encoding region and the pseudo gVIII coat pro- 
tein^ When grown in an amber suppressor strain (supE) 
of E. colt. M131XL604 was shown to produce L6 H chain- 
pVTlI fusion protein resulting in phage-displayed F(ab) In 
addition to free F{ab). When free F(ab) was desired as the 
major end product, the phage was grown In a nonsup- 
pressor strain [supO). The M13IXL605 vector, which 
solely produces F(ab)-pVW fusion protein, was con- 
structed to evaluate, by comparison, the M13IXL604 con- 
trol system. 

M131XL605 was found to display slightly higher levels 
of phage-assoclated F(ab) than M130CL604 when grown 
In the supE bacterial strain. This is likely attributed to a 
higher proportion of F(ab)-pVIII fusion protein relative to 
wild-type pVTU produced in the M131XL605- Infected 
strain. In both the supE and supO bacterial strains 
M13IXL604 secreted chimeric L6 F(ab) at levels some- 
what higher than those of M13IXL605. Phage titers of 
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M13IX31-. M13DCL604-. and M131XL605-infected cul- 
tures were found to decrease relative to the level of F(ab)- 
pVHI fusion protein incorporation (D. Yelton, unpub- 
lished observations). Taken together, these results sug- 
gest that a functionally viable phage particle may be able 
to tolerate a limited number of incorporated F(ab)-pVlII 
fusion products and that the amount of F(ab) incorpo- 
rated Into the phage coat may Inversely affect phage titers 
and overall F(ab) yield. Furthermore, the appearance of 
free F(ab) in the culture media suggests that H chain- 
pVm fusion protein unincorporated into phage is properly 
assembled with L chain and secreted as functional F(ab). 

The set of anti-Id antibodies raised against the murine 
m Ab L6 demonstrated that faithful expression and func- 
tion of bacterially produced chimeric L6 F(ab) occurs In 
the Ml 31X1 2 and M 1 31X3 1 vector system. L6 F(ab) activ- 
ity is readily detectable by either the nitrocellulose filter 
lift or ELISA formats. The anti-Id antibodies used In these 
studies serve as a convenient and Informative model for 
antibody-Ag binding, Inasmuch as the tumor Ag bound 
by L6 in vivo is yet to be purified or fully characterized. 

The Ml 31X31 and Ml 31X1 2 vector system described 
here can serve as a versatile, general purpose approach 
to F(ab) production and screening. F(ab) production In 
this system is sufficiently robust to permit multiple rep- 
licate filter lifts, a practical requirement for implement- 
ing various screening strategies. The vectors can be used 
to create combinatorial antibody libraries to Identify 
novel antibodies. Screening of sufficiently large combi- 
natorial antibody libraries could potentially allow useful 
antibody fragments of murine or human origin to be 
Isolated without the necessity to perform standard im- 
munization procedures (35). In addition, because ollgo- 
nucleotlde-directed mutagenesis Is convenient and highly 
efficient in M13 these vectors are ideal for engineering 
antibodies with new properties. It Is conceivable that the 
ability to create stable, high affinity human antibodies is 
a promising endeavor. 

In a companion publication (36). we show that the 
Ml 31X31 and M13DC12 antibody expression system 
lends itself to efficient antibody engineering by site-di- 
rected mutagenesis. Codon-based mutagenesis of L6 hy- 
pervariable regions proved effective In altering the fine 
specificity of L6 In a predefined manner. Thus the system 
allows mutagenesis, screening, and mutant F(ab) produc- 
tion to be accomplished very rapidly. 
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A PHAGE DISPLAY SYSTEM FOR DETECTION OF T CELL 
RECEPTOR-ANTIGEN INTERACTIONS 
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Abstract— The process of T cell recognition involves a complex set of interactions between the various 
components of the TCR/MHC peptide trimolecular complex. We have developed a system for 
exploring the specific binding interactions contributed by the constituent subunils of TCR complexes 
for components of their ligands. We utilized an M13 phage display system, designed for multivalent 
receptor display, to explore specific binding interactions between various TCRa chains and specific 
antigen in the absence of MHC. The multivalent TCR-phage display system was sensitive enough to 
reveal some TCRa chains capable of binding directly to antigen with the same fine specificity shown 
by the MHC-rcstricted T cells from which the a chains were derived. Cross-specificity analysis using 
two antigen-binding TCRa chains derived from T cells with different polypeptide antigen specificities 
confirmed the fidelity of this binding. In mixtures of antigen-binding and non-binding TCRa-dis- 
playing phage, specific selection was achieved at a starting frequency of 1/1000, suggesting that this 
system can be employed for selection and analysis of TCR-displaying phage libraries. While the 
binding specificities exhibited by these TCRs are unusual, they provide a novel perspective from 
which to study the specific binding interactions that constitute TCR antigen binding. 

Key words: M 13 filamentous phage, peptide binding, phage-display, TCR. 



INTRODUCTION 

Specific recognition by T lymphocytes is mediated by T 
cell antigen receptors (TCR) (Jorgensen et al y 1992b; 
Chien and Davis, 1993). The most extensively studied 
TCR-ligand interactions involve TCRa/? heterodimers 
binding to pcptide-MHC complexes. A number of stud- 
ies have focused on defining the intricate biochemical 
interactions between the various components of the TCR 
and MHC-peptide complexes. As a consequence of these 
investigations, it has been suggested that residues within 
the CDR3 regions of TCRa and fi chains interact with 
specific residues from the antigenic peptide fragment 
(Davis and Bjorkman, 1988; Engel and Hedrick, 1988; 
Danska et a/., 1 990; Jorgensen et a/., 1 992a). More recent 
studies have indicated that TCR interactions with peptide 
appear to be very specific, while interactions with MHC 
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are comparatively degenerate (Ehrich et a/., 1993). The 
TCR-MHC interactions could be established in a variety 
of configurations with the same TCR and MHC and 
appear to be influenced by the interaction of the TCR 
with the peptide. Thus, while TCR interactions with resi- 
dues from both peptide (Engel and Hedrick, 1988; Dan- 
ska et a/., 1990; Jorgensen et al. f 1992a) and MHC 
(Ajitkumar et al., 1988; Peccoud et at., 1990) appear to 
be essential for efficient T cell recognition, the dominant 
interactions are seemingly mediated by CDR3 residues 
associated with specific peptide (Ehrich et a!., 1993). 

Although most studies have focused on conventional 
TCR binding of MHC-peptide ligands, a number of 
alternative TCR-ligand interactions have emerged, pro- 
viding the basis for further studies in TCR specificity. 
Examples include TCR P chain-binding to superantigens 
(White et al, 1989; Gascoigne and Ames, 1991; White 
et a/., 1993); TCRs which specifically bind peptide-free 
mycolic acid antigens in the context of CDI restricting 
elements (Porcelli et al. y 1992); and TCRs with speci- 
ficities for carbohydrate moieties of post-translationally 
modified peptide antigens (Haurum et a/., 1994; 
Michaelssun eraL, 1994). These recent studies of non- 
conventional TCR-ligand interactions lend credence to 
earlier studies of T cells which expressed a/f TCRs capable 
of binding directly to hapten-conjugates (Rao et 



i 



13S8 



T. ONDA et aL 



1984a, 1984b; Siliciano et aL, 1986) or peptide antigens 
(Esch and Thomas, 1990) in the absence of MHC 
proteins. Interestingly, direct TCR-binding interactions 
with hapten or peptide ligands, in the absence of MHC, 
demonstrated relatively high affinity constants (Ki>s = 5 
x 10~*-6 x 10~ 6 ) when compared to the normal range 
of TCR-ligand affinities (Rao et a/., 1984b; Siliciano et 
aL, 1986; Esch and Thomas, 1990; Davis and Chien, 
1993). 

Studies of TCR that directly bind nitrophenol haptens 
suggested that the specificity was mediated by the TCRa 
subunit in a manner independent of TCR0 (Kuchroo et 
aL, 1991). Similar studies by our group and others have 
suggested the possibility that a subset of TCRa molecules 
may bind directly to antigen for which the T ceH encoding 
(he TCR protein is specific (Bissonnctte et aL, 1991; 
Greene/ a/., 1991; Mori et aL, 1993). While these unusual 0 
TCR-ligand interactions may represent a relatively small 
subset of TCR specificities, they are nonetheless 
intriguing because they provide an alternative approach 
with which to explore the many complex interactions that 
mediate specific binding. Since the constituent at and 0 
chains of TCR may bind different residues of the peptide 
ligand component in a relatively autonomous manner 
(Jorgensen et aL, 1992a, 1992b), we reasoned that it 
should be feasible to study directly the specific inter- 
actions of TCRa chains with antigenic peptide. An 
elemental analysis of the specific interactions that con- 
stitute the complex binding of TCR with MHC-peptide 
ligands would be extremely useful. While these studies 
would not refute existing theories of conventional TCR- 
antigen binding, they would provide a unique perspective 
from which to study the individual specific binding inter- 
actions between the various subunits of the TCR- 
MHC/peptide complex which contribute to the overall 
TCR binding specificity. 

We employed the filamentous phage display system for 
the study of immunological receptors (Kang et aL, 1991; 
Barbas and Lerner, 1991b) to directly assess specific bind- 
ing interactions of several TCRa polypeptides with anti- 
genic peptide and globular proteins in the absence of 
MHC. The multivalent phage display system, utilized to 
enhance avidity, was optimally suited to study inter- 
actions of TCR molecules directly with their peptide 
ligand, as the binding affinities are relatively low com- 
pared to immunoglobulin receptors. 



EXPERIMENTAL PROCEDURES 

Reagents, strains, vectors 

The Escherichia coli XLl-Blue (tef) and VCSM13 
helper phage (kan r ) were purchased from St rata gene. AH 
enzymes were purchased from Boehringer Mannheim 
and Promega. The phagemid vector pComWJ (Kang et 
aL, 1991) utilized to produce clones encoding TCRa 
chains fused to the cpVIII of M13 filamentous phage 
was generously provided by Denise Burton (The Scripps 
Research Institute). AH molecular biology procedures 
were performed according to conventional techniques as 



described in Molecular Cloning: a Laboratory Manual 
(Sambrook et aL, 1989) unless indicated otherwise. 

Peptides and antibodies 

Poly 1 8 peptides were kindly provided by Dr Bhagirath 
Singh (University of Western Ontario, Canada). Bee 
venom phosphobpase A 2 and bovine phospholipase A 2 
were purchased from Sigma. Cell lines producing hamster 
anti-mouse TCRa (H28-710.16) and anti-TCR/? (H57- 
597 hamster IgG) were kindly provided by Dr Ralph 
Kubo (Cytel, La Jolla, CA, U.S.A.). Supernatant* were 
concentrated by precipitation with 45% saturated 
ammonium chloride followed by dialysis, and the anti- 
bodies were purified by protein A chromatography. 

The TCRa cDNA and clones 

Full-length cDN A sequences encoding the A 1 . 1 TCRa 
(Green et aL, 1991). 5C.C7 TCRa (Fink et aL, 1986) 
(provided by S. Hedrick, University of California, San 
Diego, CA, U.S.A.) and the 3B3 TCRa (Mori et aL, 1993) 
(T. Mikayama, unpublished sequence), respectively, were 
used as templates in the polymerase chain reaction (PCR) 
subcloning procedures. Fifty nanograms of cDN A tem- 
plate was mixed with 60 pmol of each primer (Table 1), 
200 mM dNTP, Promega Taq polymerase buffer con- 
taining 1.5 mM MgCl 2 and five units of Taq polymerase 
in a final volume of 100 ml. Amplification was carried out 
for 20-30 cycles on a TwinBlock thermal cycler (Ericomp 
Inc., San Diego, CA, U.S.A.) under the following con- 
ditions: I min of denaturation at 94°C, 2 min of primer 
annealing at 50°C, I min of elongation at 72°C, followed 
by a final elongation at 72°C for 15 min. Amplified prod- 
ucts were hydrolysed (4 hr at 37°C) at the Xhoi and Xbal 
sites (25 U enzyme/jig PCR fragment) encoded by the 
primers, size-fractionated by agarose gel electrophoresis 
and purified using a Geneclean procedure (Bio 101 Inc., 
La Jolla, CA, U.S.A.). The PCR primers were designed 
to produce TCRa cDNA sequences that could be ligated 
into the pCombS vector in frame with the pelB leader 
sequence at the 5' end and fused to the N-terminus of the 
cpVIII protein on the 3' end. The purified Xhol - Xbal 
insert was directionally cloned into the phagemid vector 
pComb8 at the Xhol and Spel sites and trans formants 
were screened with internal oligonucleotide probes in 
colony lifts. Plasmid DNA from positive clones were 
sequenced by the dideoxy method using Scquenase 2.0 
(USB Corp., Cleveland, OH, U.S.A.), analysed using 
the MacVector analysis program (IBI, New Haven, CT, 
U.S.A.) confirming identity, orientation and in-frame 
cloning of TCRa cDNAs. 

Recombinant phage preparation 

Production of recombinant filamentous phage dis- 
playing TCRa chains was carried out essentially as pre- 
viously described for the production of phage displaying 
immunoglobulin receptors (Kange/ a/., 1991; Barbas and 
Lerner, 1991b). Briefly, E. coli XLl-Blue (Stratagene) 
cells were transformed with recombinant TCRa/pComb8 
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Variable region primers 

TCR Va (sense) 
5C.C7 

TCR V« (sense) 
3B3 

TCR V* (sense) 



Constant region primers 

TCR-VJC 

TCR Cat (antisense) 

TCR-VJ 

Truncated Car (antisense) 
Restriction sites used 



Al 



V-TtAAGAG CTCGAGA TGAAATCC TGAGT-3' 

S'-TGGACAOCQAQATGCAGAGGAACC 

5'<KXKXGCTtX5AGATGAAATCCTTGAGTXjTTTT 
GTGGTCCTGTGGCTCCAGTTAAACTGCGTG AGG A GC- 
CAGCAGCAAGTGCAGCAGAGTCCTGCA-3' 

5-CCTGTCiaAG^^ 
5-CAGGAGli3AGAGCCACC^ 



phagemid and selected on LB plates containing 100 pg/ml 
ampiciUin. Fresh ampicillin-resistant colonies were 
grown in liquid culture on a shaker at 37°C in super- 
broth (SB) for 1-2 hr in the presence of ampicillin (50 
/ig/ml) to select for cells bearing phagemid and tetra- 
cycline (10 //g/ml) to induce the F' episome. The cultures 
were then incubated with 1 raM isopropyl /J-r>thio- 
galactoside (IPTG) for 1 hr to induce production of the 
TCR-cpVIII fusion protein and then superinfected with 
VCSM13 helper phage (10 12 pfu) (Stratagenc) to give a 
final phage/cell ratio of 10-20:1. The cells were shaken 
(37°C) for an additional 2 hr and then selected for 
helper phage induced antibiotic resistance (kanamycin, 
70 /ig/ml) and grown overnight, 37'C, 250 rpm. Phage 
supernatant was cleared by centrifugation of the cultures 
(4000 rpm in a GSA rotor, Sorvall, at 4°Q. The phage 
were precipitated by adding 3% (w/v) NaCI and 4% (w/v) 
polyethylene glycol 8000 for 1 hr at 4*C and centrifuged 
at 9000 rpm in a Sorvall GSA rotor at 4°C. The phage 
pellets were resuspended in PBS to 1/25 of the original 
volume and aggregated phage and debris were removed 
by centrifugation for 5 min in a bench top micro- 
centrifuge. Phage supernatants were transferred to fresh 
tubes and stored in aliquots at — 20 3 C. 

Affinity selection panning 

The panning procedure utilized to screen binding of 
TCR displayed on the surface of filamentous phage was 
a modification of the original protocol described by 
Parmiey and Smith (1988). Wells of a micro tit ration 
ELISA plate (Immulon 2. Dynatech) were coated with 
peptide or phospholipase A 2 (Sigma) in sodium bicar- 
bonate (0.1 M, pH 9.5) at 4°C overnight in a volume of 
50 y\ at the concentrations indicated in the figure legends. 
Antibodies were. coated onto plates using Tris (50 mM, 
pH 9) at 4°C overnight in a volume of 50 /J at the 
concentrations indicated in the figure legends. The wells 
were washed twice with PBS (Dulbccco's, pH 7.4) and 
blocked by completely filling the well with 3% bovine 
scrum albumin (BSA) (Sigma, fraction V) and incubated 
for I hr at 37°C The blocking solution was flicked out, 



rinsed with PBS and 50 /il recombinant phage was added 
to each well (typically I0 lo -10" CFU unless otherwise 
indicated) and incubated for 2 hr at 37°C. The phage 
were then removed and the wells were washed 10 x (pip- 
etting up and down to wash) with TBS/Tween (50 mM 
Tris-HCI, pH 7.5, 150 mM NaCI, 0.05% Twecn 20) over 
a period of 1 hr allowing the wells to remain filled between 
washings. Washing was done carefully to ensure simi- 
larity between samples, experiments and individuals. 
Adherent phages were eluled with 50 /il elution buffer 
(0.1 M HQ adjusted to Ph 2.2 with solid glycine, I mg/ml 
BSA) for 10 min at room temperature and neutralized 
with 6 ml of 1 M Tris base. El u ted phage were used to 
infect fresh £. coli XL 1 -Blue cells, OD«o = 1 (grown in 
the presence of 1 0 /ig/ml tetracycline to induce F' episome 
expression) for 15 min at 37°C followed by selection 
on LB/ampicillin plates to assess the number of bound 
recombinant phage per ml of el u ted phage. 

Immunoblot of TCRa-p VIII fusion proteins 

Phagemid DNA of TCR-pComb8 or control con- 
structs were transformed in XLl-Blue cells and trans- 
form ants were inoculated in. 25 ml 2 xTY medium 
containing 50 /xg/ml ampicillin. Protein expression was 
induced with 0.1 mM IPTG at an OD«o = 1 Tor 7 hr at 
26°C Cells were harvested by centrifugation. sonicated 
on ice for 1 min in PBS, 1 mM PMSF, 1% NP-40 and 
the resulting lysate was centrifuged (13,000g, 4 = C) to 
remove insoluble debris. The soluble cellular proteins 
were analyzed by SDS-PAGE and immunoblotted with 
an anti-TCRat antibody (H28-710.16). 

Presentation assay 

The TCR specificity of A 1.1 hybridoma cells for our 
poly 18-rclated peptide analogues was assessed by mea- 
suring the 1L-2 responses of A 1 . 1 cells to peptide-pulsed 
BALB/c (I-A d ) spleen cells (y-irradiated 2000 rad). Super- 
natants were collected following overnight peptide stimu- 
lation and the relative production of IL-2 was assessed 
in a CTLL assay. Starved CTLL cells were washed and 
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incubated in various dilutions of sample or control super* 
natants (100 /il) in 96-well plates (Costar) at 37°C for 
16 hr. The cells were then pulsed with [ 3 H>thymidinc 
(1 /iCi/well) and harvested an additional 4-6 hr later. The 
IL-2-dependent proliferation was assessed by the relative 
level of [ 3 H}-thymidine incorporation (cpm). 

Hybridization of replica plates 

Following two rounds of affinity selection panning 
(described above), ampicillin-resistant CFUs were blot- 
led onto replica 6lters as described (Sam brook et aL, 
1989). The replica filters were denatured (0.5 N NaOH, 
1.5 M NaCl) and neutralized [\.S M NaCl, 0.5 M Tris- 
HCL (pH 7.4)] and then fixed (80°C, 2 hr). The filters 
were then hybridized with a 5' ^-labelled A 1 . 1 TCR Vat 
specific probe (5'-GAAGAGCTCGAGATGAAATC- 
CTTGAGT-3'). 
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RESULTS 
The TCR phage display 

We previously presented evidence suggesting that the 
TCRot molecule from the Al.l T cell hybridoma (V* 1.2, 
JaTA65) may bind directly to antigenic peptides for 
which this T cell is specific (Bissonnette et aL, 199 1 ; Green 
et aL, 1991). The M13 filamentous phage display system 
(Kang et aL, 1991; Barbas et aL % 1991a; Barbas and 
Lerner, 1991b) provided the means to directly assess the 
ability of Al.l TCRot polypeptides to specifically bind 
to peptide antigens. We utilized the phagemid vector, 
pComb8 (Kang et al., 1991) to generate recombinant 
phagc-displaying TCR proteins fused to the N-terminus 
of the coat protein VIII (cpVIH). The cpVUI molecules 
form the capsid coat during phage assembly (utilizing 
about 2500 cpVIH proteins per phage particle) allowing 
for multiple receptors to be displayed on the surface of 
the recombinant phage (Feud et al. f 1991 ; Greenwood et 
al., 1991; Kang et al., 1991). This multivalent expression 
can greatly enhance the avidity, which facilitates the 
study of binding interactions with moderate affinity. 

The PCR-generated fragments encoding the VJ- or 
VJC-domains of the Al.l TCRot molecule were subcloned 
into the pComb8 phagemid vector to generate recom- 
binant phage displaying multiple copies of the Al.l 
TCRa molecule on the surface. This is illustrated in Fig. 
1 (see Experimental procedures). Extracts of cells trans- 
formed with various recombinant or control phagemids 
were subjected to SDS-PAGE and immunoblotted with 
an anti-TCR Cot specific antibody (H28.7I0.16) to con- 
firm that TCRot/cpVTII fusion proteins were produced. 
As shown in Fig. 2A, a protein that migrated with the 
predicted molecular weight for the TCR-VJC/cpVIII 
fusion protein, Mr 36.5 kDa, was detected from extracts 
of cells transformed with the Al . 1 TCR-VJCa construct. 
A second protein of M r 34.5 kDa was also detected, which 
probably represents a partially degraded form of the 
fusion protein, since it was not observed in the other 
lanes. No bands were observed from extracts of cells 
transformed with either the parental phagemid (pComb8, 
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Fig. 1. Schematic protocol for die generation of chimeric con- 
structs. (A) Full-length cDNA sequences encoding various 
TCRot chains were used as templates in the. polymerase chain 
reaction (PCR) subcloning procedures. The 5' PCR primers 
(arrows) contained an Xhoi site designed to generate a PCR 
fragment that could be H gated into the pCombS vector in frame 
with the pelB leader sequence. The 3' primers (arrows) con- 
tained a sequence encoding a flexible linker (GOGS) and an 
Xba\ site to generate a fusion gene with the major cpVIII upon 
ligation. The PCR fragments were digested with Xbal and Xhoh 
purified and direcuonally cloned into the phagemid vector 
pCombS. (B) The recombinant TCRo-pCombS phagemids 
contained an amp' gene utilized to facilitate drug selection of 
recombinant phage. Production of the recombinant 
TCRa/cpVHI fusion protein was driven by the inducible LacZ 
promoter following transformation of competent XL 1 -Blue 
cells (Stratagene). Transport into the pcriplasmic space during 
production was mediated by the pelB leader sequence. (C) The 
predicted structure of the recombinant TCRa/cpVIU fusion 
proteins consisted of a TCRa polypeptide and a flexible linker 
(GGGS) fused to the N-terminus of the cpVIII molecule. The 
pelB leader sequence at the N-terminal end is cleaved off fol- 
lowing transport into the periplastnic space. The recombinant 
fusion protein was incorporated into the viral capsid coat dur- 
ing phage assembly to generate recombinant M13 phage dis- 
playing TCR molecules on the surface. 

vector only) or the A 1 . 1 TCR VJa construct, which both 
lack the Ca domain recognized by the H28.710.16 anti- 
body used for detection. 
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Fig. 2. Expression of TCRa/pComb8 recombinant fusion 
proteins. (A) Western blot analysis to detect expression of Al. I- 
TCR-VJCa/cpVHI fusion proteins in E. colt Lysales from 
recombinant TCRa-pComb8 (lane 1: Al.l-TCR VJ; lane 2: 
Al.I-TCR-VJQ or parental pComb8 Qunt 3: vector only) 
phagemid transformed XL 1 -Blue cells were subjected to SDS- 
PAGE and immunoblotted with an anti-TCR Cot antibody 
(H28.710.I6). The position of the band migrating at the pre- 
dicted molecular weight of the TCR-VJCa/cpVIIl fusion pro- 
tein (M, 36.5 kD) is indicated by the arrow. (B) Display of 
TCR-VJCa polypeptides on the surface of recombinant phage. 
Binding of recombinant phage to anti-TCR antibody was uti- 
lized to detect the presence of TCR on the phage surface. 
Recombinant TCR<x-pComb8 phage or parental pCombS 
phage (indicated in the figure) were panned on plates coated 
with antibodies specific for TCR Cat (H28.710. 16, solid bars) or 
TCR V06 (H57-597, hatched bars). The number of bound 
phage was assessed by counting the number of etuted amptcillin 
resistant CFU following extensive washing as described in the 
experimental procedures. 

The presence of TCRa molecules displayed oh the sur- 
face of recombinant phage was assessed using anti-TCR 
antibodies. Recombinant phage were "panned" on plas- 
tic dishes (Parmley and Smith, 1988) coated with either 
anti-TCR Ox (H28-710.16) or anti-TCR Cp (H57-597) 
antibodies (Fig. 2B). Following extensive washing, the 
bound phage were eluted with a low pH buffer. Sub- 
sequent infection of host cells with eluted phage was 
readily achievable as the cpIII proteins, utilized for 
adsorption to the F' episome and infection of E. coli> 
remain intact following acid elution. Since the pCombS 
phagemid vector contains an amp* gene, the number of 
bound phage could be assessed by counting the number 
of ampicillin-resistant CFU following infection of host 
cells with eluted phage. In addition to recombinant phage 
displaying Al . 1 TCRa chains, we also constructed phage 
displaying the TCRa chains from the 5C.C7 T cell hybrid- 
oma (specific for pigeon cytochrome C peptide restricted 



to I-E k ) (Fink et a/., 1986). As shown in Fig. 2B, both 
the A 1.1 TCR-VJCa as well as the 5C.C7-TCR-VJCa 
recombinant phage were readily bound by the anti-TCRa 
but not the anti-TCR/f antibodies. As predicted, the par- 
ental pComb8 phage (vector only) and recombinant 
TCR-VJa phage showed no binding to either antibody 
as these phage lack the TCR Cot determinant recognized 
by H28.710.16 antibody. Thus, polypeptides bearing 
TCR Ca specific determinants were effectively displayed 
on the surface of recombinant phage particles, indicating 
that TCRa/cpVIIl fusion proteins could be effectively 
incorporated into the capsid coat during phage assembly. 

Specificity of TCR displayed on- recombinant phage 

The Al .1 T hybridoma cells recognize a synthetic poly- 
peptide with the sequence poly [EYK(EYA) 5 ] (poly 18) 
presented by I-A d (Fotedar et «/., 1985). The antigenic 
fine specificity of A 1 . 1 cells for a series of poly 1 8-related 
peptide analogues was assessed by measuring the 1L-2 
responses to peptide pulsed BALB/c (I-A d ) spleen cells. 
As shown in Fig. 3 A, A 1.1 cells produced cytokine in 
response to two of the peptides, EYK(EYA) 4 and EYK- 
(EYA) 4 EYK. Peptide analogues, substituted with ala- 
nines at residues 3 or 10, failed to stimulate A 1.1 cells. 
These results are in agreement with those reported by 
others for this cell line (Fotedar et al.. 1985). The anti- 
genic fine specificity of the A 1.1 hybridoma cells for the 
four peptides described above provided a model with 
which to compare specificity of TCRa molecules dis- 
played on recombinant phage. 

The poly 1 8-related peptide analogues utilized to 
characterize the specificity of A 1.1 hybridomas were 
coated onto 96-wcll plates for panning experiments to 
assess the binding capacity and specificity of the TCRa 
molecules displayed on the phage surfaces. Bound phage 
were extensively washed, eluted and then quantitated by 
counting the number of ampicillin-resistant CFUs fol- 
lowing infection of host cells. Representative of a series of 
10 binding experiments, Fig. 3B shows that recombinant 
phage expressing AI.l TCR-VJCa preferentially bound 
to the antigenic peptides, EYKEYAEYAEYAEYAEYK 
and EYKEYAEYAEYAEYA, but not to non-antigenic 
peptides with an alanine substitution at residues 3 (EYK- 
EYAEYAEYAEYA) or 10 (EYKEYAEYAEYAEYA). 
The parental pComb8 phage (vector only) and recom- 
binant phage displaying 5C.C7-TCR-VJCa showed no 
specific binding to any of the four poly 18 peptide ana- 
logues (Figs 3B and 4A). Recombinant phages displaying 
AI.l TCR VJa, which lack the Ca-domain, also showed 
preferential binding to the antigenic peptides, indicating 
that the specific binding was mediated by the TCR Va- 
and Jot-region domains (Fig. 4B). The observed pref- 
erential binding to antigenic peptide was reproducible 
and consistent over a wide range of phage tit res when 
we collected the quantitative data from seven different 
experiments utilizing several different recombinant phage 
preparations from two different recombinant constructs 
(Fig. 4C). Thus, direct binding by phage displaying AI.l 
TCRa to peptides reflects the antigenic specificity of the 
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Fig. 3. Antigenic fine specificity of A 1.1 hybridoma cells and 
Al.l-TCRa Recombinant phage. (A) Fine specificity of Al.l 
hybridoma cells. The ability of poly 18 related peptide ana- 
logues to induce an IL-2 response was utilized to assess the fine 
specificity of Al.l hybridoma cells. The BALB/c spleen cells (1- 
A*) were pulsed with the indicated poly 18 related pepudc 
analogues and co-cultured with Al.l hybridoma cells as 
described in the experimental procedures. The relative amount 
of IL-2 produced in response to the peptide antigens is indicated 
in the K-axis as the amount of tritialed thymidine uptake in a 
CTLL assay. (B) Comparative fine specificity of peptide binding 
by recombinant Al.l-TCR phage. The binding capacity of 
recombinant phage displaying Al.l-TCR-VJCar was assessed 
using the same four poly 18 related peptide analogues used to 
assess the fine specificity of the Al.l hybridoma cells. Plates 
were coated with the indicated peptide analogues followed by 
panning with recombinant phage displaying Al.l TCR-VJGx. 
The number of bound phage following extensive washing was 
assessed by determining the number of eluted ampicillin resist- 
ant CFU. 



Al.l hybridoma from which the TCRa cDNA sequence 
. was derived (see Fig. 3 A), despite the absence of class II 
MHC presentation. 

Affinity selection of specific TCRcl 

To assess the capacity for affinity selection of phage 
displaying specific TCR polypeptides, recombinant 
phage displaying Al.l TCR-VJGx were mixed 1:100 or 
1:1000 with phage displaying 5C.C7-TCR-VJCa and 
then subjected to two rounds of affinity enrichment on 
plates coated with antigenic poly 1 8 peptide analogues. 
The eluted phage were used to infect host cells and col- 
onies from ampicillin plates were lifted onto nitro- 
cellulose filters. The colony-lifts were lysed and 
hybridized with a DN A probe specific for the A 1 . 1 TCRa 



sequence. We found that 85% (468/554) of the colonies 
from the 1:100 mix and 76% (1 13/149) of the colonies 
from the 1:1000 mix contained Al.l TCRa specific 
sequences (Fig. 5). Thus, two rounds of selection enriched 
the phage displaying Al.l TCRa by over 500- and 1000- 
fold, respectively. These results suggest that the phage- 
display system can be utilized to select for T cell receptors 
capable of binding directly to peptide ligands and has 
potential to be utilized as a gene identifying system for 
TCRa chains with relatively high affinity for their peptide 
ligand. 

The TCRa binding directly to protein antigen 

While phage displaying 5C.C7-TCR-VJCa railed to 
bind to the poly 18 peptide analogues bound by Al.l 
TCRa recombinant phage, these phage also failed to bind 
antigenic cytochrome C peptide, in repeated attempts 
(data not shown). Thus, differences in detectable binding 
of TCR Vat to specific antigenic peptides may represent 
variance in the particular TCR-Iigand interactions such 
that only a subset of TCR, including that on Al.l cells, 
have direct interactions strong enough to be detectable 
in this system. Accordingly, we constructed recombinant 
phage displaying TCRa from the 3B3 T cell hybridoma, 
which has previously been suggested to express a TCRa 
chain product capable of direct interaction with antigen 
(Mori et al. t 1993). The 3B3 hybridoma was generated 
from BALB/c mice immunized with bee venom phos- 
pholipase A 2 (PLAJ and is specific for a bee venom 
PLA 2 derived peptide (residues" 19-34) in the context of 
I-A d (Mori et aL v 1993). Two aspects of 3B3 TCR binding 
characteristics made it particularly interesting for phage 
display studies of direct antigen-binding TCRa chains. 
Firstly, three-dimensional structural analysis of bee 
venom PLA 2 indicated that the antigenic peptide residues 
associated with 3B3 TCR binding are exposed to the 
outer surface of the PLA 2 molecule in the form of a 
flexible loop (Scott et a/., 1990). Secondly, previous pep- 
tide competition studies of 3B3 TCR specificity suggested 
that the TCRa chain may be able to bind directly to 
unprocessed bee venom PLA 2 (Mori et a/. f 1993). There- 
fore, we constructed recombinant phage displaying 3B3- 
TCR-VJCa to test the ability of this TCRa chain to 
directly bind to unprocessed bee venom PLA> protein in 
a specific manner. 

The PCR-generated fragments encoding the 3B3- 
TCR-VJCa polypeptide were subcloned into the 
pComb8 vector for production of recombinant phage. 
Bee venom PLA 2 was coated onto 96-well plates for pan- 
ning experiments to assess the binding capacity and speci- 
ficity of the 3B3 TCRa chains displayed on the phage 
surface. We observed binding of recombinant 3B3-TCR- 
VJCa phage, but not parental pComb8 phage (vector 
only), to bee venom PLA 2 coated wells (Fig. 6A). Fur- 
thermore, binding of recombinant 3B3-TCR-VJCa 
phage to BSA (3% BSA, used for blocking) was not 
observed. Moreover, the recombinant phage displaying 
3B3 TCR-VJCa did not bind to bovine PLA 2 (Fig. 6B), 
which lacks the exposed hydrophilic loop containing the 
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Fi* 4. Antigenic binding is mediated by the V j regions of specific TCR-recombinant phage (A) The 
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protocol described in the experimental procedures. Plates were coated with the indicated peptide 
analogues and panned with the indicated phage. The number of bound phage following extensive 
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are included for comparison. 



antigenic peptide sequence associated with 3B3 TCR 
binding (Dijkstra et al y 1981; Mori et a/., 1993). The 
observed preferential binding to antigenic bee venom 
PLA 2 was reproducible and consistent over a wide range 
of phage titres when we collected the quantitative data 
from five different experiments utilizing several different 
recombinant phage preparations (Fig. 6C). Thus, the 
phage display system enabled us to assess the novel 
Capacity of a TCRa chain to bind directly to antigenic 
protein in the absence of MHC 



In order to further assess the specificity of the TCRa 
recombinant phages for their respective antigenic ligands, 
we did parallel panning experiments with the A 1 . 1 TCR- 
VJCa and 3B3 TCR-VJGx recombinant phage (Fig. 7). 
Phage displaying AI.I TCR-VJCa showed binding to 
the antigenic poly 18 peptide analogue (EYK- 
EYAEYAEYAEYAEYK) but not to bee venom PLA 2 . 
In contrast, phage displaying 3B3 TCR-VJCa dem- 
onstrated specific binding to bee venom PLA 2 but not to 
the poly 18 peptide analogue. Thus, the TCR phage- 
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Fig. 5. Affinity selection of specific TCR on recombinant phage. 
A mixing experiment was done to assess the capacity for affinity 
selection of peptide specific TCR displayed on the surface of 
recombinant Al.l-TCRfl-pComb8 phage. The Al.l-TCR- 
VJGx phages were mixed 1:100 or 1:1000 with 5C.C7-TCR- 
VJCa phage and subjected to two rounds of affinity selection 
by panning as described in the experimental procedures. Colony 
lifts were hybridized with a radiolabeled Al.l-TCRat specific 
oligonucleotide probe. Autoradiographs of the hybridized rep- 
lica filters are shown in panels A-D. Autoradiographs of control 
colony lifts from plates inoculated with unmixed 5C.C7 TCRoc 
recombinant phage (panel A) or unmixed Al.l-TCRa recom- 
binant phage (panel B) are shown for comparison with plates 
from mixing experiments starting with Al.la/5C.C7ct ratios of 
1:100 (panel C) or 1:1000 (panel D) prior to the two rounds of 
affinity selection. 



display system described here facilitated the study of 
TCRct chains capable of binding directly to antigenic 
epitopes in a specific manner. 

DISCUSSION 

The phage-display system (Smith, 1985) has proved to 
be extremely useful for the study of specific receptor- 
ligand interactions (Winter et a/., 1994). The advantage 
of this system for immunological studies is that the dis- 
play of receptors on recombinant phage provides a means 
to link directly antigen recognition structures and the 
genetic instructions encoding the receptor (Kang et a/., 
1991). Consequently, the phage display system has been 
invaluable for the characterization and selection of anti- 
body specificities for antigenic ligands. Here, we have 
shown thai this system can be utilized further to study 
specific binding interactions of constituent TCR poly- 
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Fig. 6. Antigenic specificity of recombinant phage displaying 
TCR* polypeptide from 3B3 hybridoma cells." (A) The binding 
capacity of parental pCombS phage or recombinant phage dis- 
playing TCR-VJCa from the 3B3 hybridoma cell to bee venom 
PLA 2 was assessed using the panning protocol described in the 
experimental procedures. Plates were coated with the indicated 
concentration of bee venom PLA 2 and then panned with 10* 
CFU of recombinant phage displaying 3B3-TCR-VJCa or par- 
ental pCombS phage. The number of bound phage following 
extensive washing was assessed by determining' the number of 
ampicillin resistant CFU/pJ. (B) The specificity of recombinant 
phage displaying TCR-VJCa from the 3B3 hybridoma cell for 
bee venom PLA 3 and bovine PLA 2 was assessed using the pan- 
ning protocol. The number of bound phage following panning 
with 10 10 CFU of recombinant phage and extensive washing 
was by determining the number of ampicillin resistant 

CFU/jd. (C) The relative binding specificity of several recom- 
binant phage preparations, displaying 3B3-TCR-VJG* for bee 
venom PLA 2 or bovine PLA 2 arc compared from five different 
panning experiments as described above. 
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peptides to antigenic ligands. Specifically, the phage-dis- 
play system facilitated a novel demonstration of direct 
binding by TCRa chains to peptide or protein antigens 
in a specific manner. These results indicate that for some 
TCRa chains the binding affinity to antigenic peptide 
may be sufficiently high that it can be detected in the 
absence of the TCR0 chain, and in an MHC-independent 
manner. It has been postulated previously that TCR 
binding to peptide by Va- and V/J-subunits may be rela- 
tively autonomous and that the resulting TCR-peptide 
interactions can dramatically influence TCR-MHC inter- 
actions; indicating the "primacy" of TCR-peptide inter- 
actions (Ehrich et a/., 1993). Our results extend these 
findings by demonstrating that the dominant interactions 
of certain TCRa chains for peptide antigens may be 
sufficiently high that they can be analysed independently. 
However, these interactions are quite unusual in that they 
do not require the expression of the second TCR subunit 
or normal MHC and coreceptor interactions. These 
results raise the concern that this model docs not reflect 
typical TCR-ligand interactions. Indeed, gene transfer 
studies of A 1.1 TCRa chain sequences indicated that the 
expression of the a chain alone was not sufficient to confer 
responsiveness to peptide antigen alone or in the context 
of I-A d (H. Zheng, A. Fotedar and D.R. Green; unpub- 
lished data). Furthermore, activation of A LI hybridoma 
cells required specific peptide antigen to be presented in 
the context of the appropriate MHC (I-A d ) (Fotedar et 
a!., 1985). The requirement for MHC presentation and 
natural TCRa/J conformation for functional interaction 
of cell surface expressed receptors may be essential for 
bringing the respective binding sites into close enough 
proximity to establish the specific binding interaction 
between the residues of opposite charge which mediate 
binding. The specificity data presented here (Fig. 3 and 
Fig. 4 and Fig. 7) suggest that these requirements may 
not be necessary for some receptors to detect specific 



binding in non-cellular TCR phage-display systems. 
Therefore, the data presented here do not suggest that 
the TCRa chain alone would be sufficient for T cell rec- 
ognition and activation, but rather that the TCR phage- 
display system provides a means to explore the specific 
binding interactions that mediate TCR binding to antigen 
at a constituent level. While this study does not refute 
commonly held theories of TCR-antigen interactions, 
this system does provide a means to explore the innate 
specific binding interactions contributed by the con- 
stituent subunits of TCR complexes for the peptide com- 
ponent of MHC-peptide ligands. In the absence of 
detailed structural data for the TCR; peptide/M HC tri- 
molecular complex, models such as this can provide 
important insight into the nature-of the elemental inter- 
actions that constitute TCR binding to MHC-peptide 
complexes. "Furthermore, such information should 
facilitate interpretation of the structural data for the 
trimolecular complexes when this becomes avail- 
able. 

Previous studies indicate that TCR-peptide binding 
interactions are very specific, since single amino acid 
changes had profound effects on binding (Engel and Hed- 
rick, 1 988; Danska et a/., 1990). In accordance with these 
results, the direct binding of phage-displayed TCRa 
chains to antigen appeared to be specific. Recombinant 
phage-displaying A LI TCRa chains bound to antigenic 
peptide analogues but not to mutated peptide analogues 
that failed to activate AL1 hybridoma cells (Fig. 3 and 
Fig. 4). Additionally, the cross-specificity experiment 
indicated that recombinant phage-displaying A 1. 1 TCR- 
VJCa or 3B3-TCR-VJCat bound their respective antigens 
but did not cross-react (Fig. 7). 

The failure of repeated attempts to demonstrate spec- 
ific binding of recombinant phage-displaying 5C.C7 
TCRa to cytochrome C peptide indicated that only a 
subset of TCR Va have the capacity for direct interactions 
with antigen strong enough to be detectable in this 
system. Previous studies of 5C.C7 TCR interactions with 
peptide antigen indicate that specific binding was 
mediated by CDR3 residues from both the a and & chains 
(Jorgensen et aL, 1992a, 1992b). Thus, binding to antigen 
by receptors such as 5C.C7 TCR may require the com- 
bined binding interactions of the a and chains to 
mediate strong specific binding, while antigen binding by 
receptors such as A LI and 3B3 TCR may be dominated 
by specificities mediated predominantly by the TCRa 
chain. 

The phage-display system described here extends the 
study of T cell recognition by providing a means to sim- 
plify the study of the specific interactions that constitute 
the overall receptor-ligand interactions. The power of 
the phage-display system is that recombinant phage can 
be amplified and subjected to multiple rounds of affinity 
selection as they conjain the genetic instructions for pro- 
duction of the receptor molecule. A system for selection 
and cloning of antigen-binding TCR molecules will be 
useful in exploring TCR binding to various antigens and 
should provide a means to more closely examine TCR- 
ligand interactions. 
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[57] ABSTRACT 

Filamentous phage comprising a matrix of cpvm proteins 
encapsulating a genome encoding first and second polypep- 
tides of an antogenously assembling receptor, such as an 
antibody, and a receptor comprised of the first and second 
polypeptides surface- integrated into the matrix via a cpvm 
membrane anchor domain fused to at least one of the 
polypeptides with a mutagenized CDR3 region. 

26 Claims, 12 Drawing Sheets 
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SHINE-DALGARNO MET 

G G C CG CA A ATT CT ATTTCA AGG AG ACAGT C AT A ATG 
CGTTTAAGATAAAGTTCCTCTGTCAGTATTAC 



LEADER SEQUENCE 



'AAATACCTATTGCCTACGGCAGCCGCT 
TTTATGGATAACGGATGCCGTCGGCGA 



LEADER SEQUENCE 

GGATTGTTATTACTCGCTGCCCAACCAG 1 
CCTAACAATAATGAGCGACGGGTTGGTC 



LINKER LINKER 
V H BACKBONE 



NCOI 



XHOI SPEI 



CCATGGCCCAGGTGAAACTG'CTCGAGATTTCTAGACTAGT 
GGTACCGGGTCCACTTTGACGAGCTCTAAAGATCTGATCA 



STOP LINKER 



'TyrProTyrAspValProAspTyrAlaSer , 

TACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTCG 
ATGGGCATGCTGCAAGGCCTGATGCCAAGAATTATCTTAAGCAGCT 



FIG. 3 
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H ETEROD EMERIC RECEPTOR LIBRARIES 
USING PHAGEMIDS 

CROSS-REFERENCE TO RELATED 

APPLICATION 5 

This application is a continuation of U.S. patent applica- 
tion Ser. No. 07/826.623. filed on Jan. 27. 1992. now 
abandoned, which is a continuation-in-part of U.S. patent 
application Ser. No. 07/683.602, filed Apr. 10. 1991. now 1Q 
abandoned, the disclosures of which are hereby incorporated 
by reference. 

This invention was made with government support under 
Grant No. CA 27489 awarded by the National Institutes of 
Health. 15 

TECHNICAL FIELD 

The present invention relates to cloning vectors and 
methods for producing a library of DNA molecules capable 
of expressing a fusion protein on the surface of a filamentous 20 
phage particle. 

BACKGROUND 

Filamentous bacteriophages are a group of related viruses ^ 
that infect bacteria. They are termed filamentous because 
they are long and thin particles comprised of an elongated 
capsule that envelopes the deoxyribonucleic acid (DNA) 
that forms the bacteriophage genome. The F pili filamentous 
bacteriophage (Ff phage) infect only gram-negative bacteria M 
by specifically adsorbing to the tip of F pili. and include fd. 
flandM13. 

The mature capsule of Ff phage is comprised of a coat of 
five phage-encoded gene products: cpVTIL the major coat 
protein product of gene VIII that forms the bulk of the 35 
capsule; and four minor coat proteins. cpIH and cpIV at one 
end of the capsule and cpVTJ and cpIX at the other end of the 
capsule. The length of the capsule is formed by 2500 to 3000 
copies of cpvm in an ordered helix array that forms the 
characteristic filament structure. About five copies each of ^ 
the minor coat proteins are present at the ends of the capsule. 
The gene Hi-encoded protein (cpIH) is typically present in 
4 to 6 copies at one end of the capsule and serves as the 
receptor for binding of the phage to its bacterial host in the 
initial phase of infection. For detailed reviews of Ff phage 45 
structure, see Rasched et aL. Microbiol Rev* 50:401-427 
(1986); and Model et al.. in "The Bacteriophages, Volume 
2", R. Calendar. Ed., Plenum Press, pp. 375-456 (1988). 

The assembly of a Ff phage particle involves highly 
complex mechanics. No phage particles are assembled 50 
within a host cell; rather, they are assembled during extru- 
sion of the viral genome through the host cell's membrane. 
Prior to extrusion, the major coat protein cpVTH and the 
min or coat protein cpm are synthesized and transported to 
the host cell's membrane. Both cpvm and cpm are 55 
anchored in the host cell membrane prior to their incorpo- 
ration into the mature particle. In addition, the viral genome 
is produced and coated with cpV protein. During the extru- 
sion process. cpV-coated genomic DNA is stripped of the 
cpV coat and simultaneously recoated with the mature coat 60 
proteins. The assembly mechanisms that control transferral 
of these proteins from the membrane to the particle is not 
presently known. 

Both cpm and cpVTU proteins include two domains that 
provide signals for assembly of the mature phage particle. 65 
The first domain is a secretion signal that directs the newly 
synthesized protein to the host cell membrane. The secretion 
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signal is located at the amino terminus of the protein and 
targets the protein at least to the cell membrane. The second 
domain is a membrane anchor domain that provides signals 
for association with the host cell membrane and for asso- 
ciation with the phage particle during assembly. This second 
signal for both cpVTU and cpm comprises at least a hydro- 
phobic region for spanning the membrane. 

cpvm has been extensively studied as a model membrane 
protein because it can integrate into lipid bilayers such as the 
cell membrane in an asymmetric orientation with the acidic 
amino terminus toward the outside and the basic carboxy 
terminus toward the inside of the membrane. The mature 
protein is about 50 amino acid residues in length of which 
11 residues provide the carboxy terminus, 19 residues pro- 
vide the hydrophobic transmembrane region, and the 
remaining residues comprise the amino terminus. Consid- 
erable research has been done on the secretion signal region 
of cpvm to advance the study of membrane protein syn- 
thesis and targeting to membranes. However, tittle is known 
about the changes that are tolerated in the structure of the 
cpvm membrane anchor region mat would allow for assem- 
bly of phage particles. 

Manipulation of the sequence of cpHI shows that the 
C-terminal 23 amino acid residue stretch of hydrophobic 
amino acids normally responsible for a membrane anchor 
function can be altered in a variety of ways and retain the 
capacity to associate with membranes. However, those 
anchor-modified cpm proteins lost their ability to geneti- 
cally complement gene TH mutants indicating that the 
requirements of a membrane anchor for functional assembly 
have not been elucidated. 

Ff phage -based expression vectors have been described in 
which the entire cpm amino acid residue sequence was 
modified by insertion of short polypeptide "epitopes'* 
IParmely et aL, Gene, 73:305-318 (1988); and Cwirla et al.. 
Pmc. Natt. Acad. Sci. USA. 87:6378-6382 (1990)] or an 
amino acid residue sequence defining a single chain anti- 
body domain. McCafferty et al.. Science, 348:552-554 
(1990). These hybrid proteins were synthesized and 
assembled onto phage particles in amounts of about 5 copies 
per particle, a density at which normal cpIH is usually found. 
However, these expressed fusion proteins include the entire 
cpm amino acid residue sequence and do not suggest fusion 
proteins that utilize only the carboxy terminal membrane 
anchor domain of cpm. 

In addition, no expression system has been described in 
which a phage coat protein has been engineered to allow 
assembly of a heteromeric molecule that is functional and 
capable of incorporation into the coat of a phage particle. 

BRIEF SUMMARY OF THE INVENTION 

A new. high density, surface-integration technology has 
been discovered for expressing a recombinant gene product 
on the surface of a filamentous phage containing the recom- 
binant gene. The invention uses a filamentous phage cpvm 
membrane anchor domain as a means for linking gene- 
product and gene during the assembly stage of filamentous 
phage replication. 

That is. during filamentous phage replication, gene VHI- 
encoded proteins assemble into a matrix which encapsulates 
the phage genome. It has now been discovered that (1) phage 
assembly is not disrupted when recombinant gene VDI- 
encoded proteins are present (2) recombinant gene VIII- 
encoded proteins can be integrated into the assembling 
matrix, and (3) integration into the matrix can be directed to 
occur in a surface-accessible orientation. 



5.759.817 

3 4 

The present invention can be advantageously applied to FIG. 3 illustrates the sequence of the double-stranded 

the production of heteroraeric receptors of predetermined synthetic DNA inserted into Lambda Zap to produce a 

specificity, i.e.. it can be used to produce antibodies. T-cell Lambda Hc2 expression vector. The preparation of the 

receptors and the like that bind a preselected ligand. double -stranded synthetic DNA insert is described in 

Thus, the present invention provides for linking the func- 5 Example la(ii). The various features required for this vector 

tions of heteromeric receptor recognition and filamentous to express the V^oding DNA homologs include the Shine- 

phage replication in a method for isolating a heteromeric Dalgarno ribosome binding site, a leader sequence to direct 

receptor. The method produces a filamentous phage com- the expressed protein to the periplasm as described by 

prised of a matrix of gene Vm-encoded proteins that encap- Mouva et aL, 7. BioL Chem^ 255:27. 1980. and various 

sulate a recombinant genome. The recombinant genome 10 restriction enzyme sites used to operatively link the V w 

contains genes encoding the proteins of the heteromeric homologs to the expression vector. The V H expression 

receptor proteins. The heteromeric receptor is surface- vector sequence also contains a short nucleic acid sequence 

integrated into the encapsulating matrix via a gene VTH- that codes for amino acids typically found in variable 

encoded membrane anchor domain that is fused by a peptide regions heavy chain (V„ Backbone). This V„ Backbone is 

bond during translation to one of the heteromeric receptor 1S just upstream and in the proper reading as the V H DNA 

proteins. The heteromeric receptor and the genes which homologs that are operatively linked into the Xho I and Spe 

encode it are physically linked during the assembly stage of I cloning sites. The sequences of the top and bottom strands 

the phage replication cycle. Specifically binding the of the double-stranded synthetic DNA insert are listed 

receptor-coated phage to a solid-support advantageously respectively as SEQ. ID. NO. 1 and SEQ. E>. NO. 2. The 

provides a means for isolating a recombinant genome that ^ synthetic DNA insert is directionaUy ligated into Lambda 

encodes a desired heteromeric receptor from a diverse Zap II digested with the restriction enzymes Not 1 and Xho 

library of recombinant genomes. I to form Lambda Hc2 expression vector. 

In one embodiment the present invention contemplates FIG. 4 illustrates the major features of the bacterial 

an antibody molecule comprising heavy- and Ught-chain expression vector Lambda Hc2 (V„ expression vector). The 

proteins, said heavy-chain protein comprising a V^domain w synthetic DNA sequence from FIG. 3 is shown at the top 

flanked by an amino-terminal prokaryotic secretion signal along with the T 3 polymerase promoter from Lambda Zap JL 

domain and a carboxy-terminal filamentous phage cpVIU The orientation of the insert in Lambda Zap II is shown. The 

membrane anchor domain, said light chain protein compris- V„ DNA homologs are inserted into the Xho I and Spe I 

ing a V^-domain fused to an amino- terminal prokaryotic cloning sites. The read through transcription produces the 

secretion signal dornain. 30 decapeptide epitope (tag) that is located just 3* of the cloning 

In another embodiment, the present invention contem- s * te - 

plates a vector for expressing a fusion protein, said vector FIG. 5 illustrates the sequence of the double-stranded 

comprising a cassette that includes upstream and down- synthetic DNA inserted into Lambda Zap to produce a 

stream translatable DNA sequences operatively linked via a Lambda Lc2 expression vector. The various features 

sequence of nucleotides adapted for directional ligation of 35 required for this vector to express the V L -coding DNA 

an insert DNA, said upstream sequence encoding a prokary- homologs are described in FIG. 3. The V L -coding DNA 

otic secretion signal, said downstream sequence encoding a homologs are operatively linked into the Lc2 sequence at the 

filamentous phage gene cpVUI membrane anchor, said Sac I and Xho I restriction sites. The sequences of the top 

translatable DNA sequences operatively linked to a set of and bottom strands of the double-stranded synthetic DNA 

DNA expression signals for expression of said translatable 40 insert are listed respectively as SEQ. ID. NO. 3 and SEQ. ID. 

DNA sequences as portions of said fusion protein. NO. 4. The synthetic DNA insert is directionaUy ligated into 

Lambda Zap E digested with the restriction enzymes Sac I 

BRIEF DESCRIPTION OF THE DRAWINGS and Not J£ ^Lambda U2 expression vector. 

In the drawings forming a portion of this disclosure: pig. 6 illustrates the major features of the bacterial 

FIG. 1 illustrates a schematic diagram of the irnmunoglo- 45 expression vector Lc2 (V L expression vector). The synthetic 

bulin molecule showing the principal structural features. DNA sequence from FIG. 5 is shown at the top along with 

The circled area on the heavy chain represents the variable the T 3 polymerase promoter from Lambda Zap n. The 

region (V w ), a polypeptide containing a biologically active orientation of the insert in Lambda Zap II is shown. The 

(ligand binding) portion of that region, and a gene coding for DNA homologs are inserted into the Sac I and Xho I cloning 

that polypeptide, are produced by the methods of the present ^ sites. 

invention. FIG. 7 illustrates the didstronic expression vector. 

FIG. 2Ais a diagrammatic sketch of a heavy (H) chain of pCorab. in the form of a phagemid expression vector. To 

human IgG (IgGl subclass). Numbering is from the produce pComb, phagemids were first excised from the 

N-terminus on the left to the C-terrnimis on the right Note expression vectors. Lambda Hc2 and Lambda Lc2. using an 

the presence of four domains, each containing an ictrachain 55 i n vivo excision protocol . according to manufacturers 

disulfide bond (S — S) spanning approximately 60 amino instructions (Stratagene. La Jolla, Calif.). The pComb 

acid residues. The symbol CHO stands for carbohydrate. expression vector is prepared from Lambda Hc2 and 

The V region of the heavy (H) chain (V„) resembles V L in Lambda Lc2 which do not contain V^-coding or V^-coding 

having three hypervariable CDR (not shown). DNA homologs. The in vivo excision protocol moved the 

FIG. 2B-1 is a diagrammatic sketch of a human light 60 cloned insert from the Lambda Hc2 and Lc2 vectors into a 

(Kappa) chain (Panel 1). Numbering is from the N-terminus phagemid vector. The resultant phagemids contained the 

on the left to the C-terrninus on the right. Note the intra chain same nucleotide sequences for antibody fragment cloning 
disulfide bond (S — S) spanning about the same number of and expression as did the parent vectors. Hc2 and Lc2 

amino acid residues in the V L and C^, domains. phagemid expression vectors, were separately restriction 

FIG. 2B-2 shows the locations of the complementarity- 65 digested with Sea I and EcoR L The linearized phagemids 

determining regions (CDR) in the domain. Segments were ligated via the Sea I and EcoR I cohesive termini to 

outside the CDR are the framework segments (FR). form the dicistronic (combinatorial) vector, pComb. 
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FIG. 8 illustrates a schematic diagram of the composition 
of pCBAK8-2b phagemid vector, the pathway for Fab 
assembly and incorporation in phage coaL The vector carries 
the chloramphenicol acetyl transferase (CAT) marker gene 
in addition to the nucleotide residue sequences encoding the 
Fd-cpVUI fusion protein and the kappa chain. The f 1 phage 
origin of replication facilitates the generation of single 
stranded phagemid. The isopropyl thiogalactopyranoside 
(TPTG) induced expression of a dicistronic message encod- 
ing the Fd-cpVm fusion (V w . C W1 . cpVTH) and the light 
chain (V L . CJ leads to the formation of heavy and light 
chains. Each chain is delivered to the periplasmic space by 
the pelB target sequence, which is subsequently cleaved. 
The heavy chain is anchored in the membrane by cpVTH 
fusion while the light chain is secreted into the periplasm. 15 
The heavy chain in the presence of light chain assembles to 
form Fab molecules. The Fabs are incorporated into phage 
particles via cpVm (black dots). 

FIG. 9 illustrates the electron rnicrographic localization of 
5-7 am colloidal gold particles coated with NPN-BSA 20 
conjugate along the surface of filamentous phage, and from 
phage emerging from a bacterial cell. Panel 9A shows 
filamentous phage emerging from the surface of the bacterial 
cell specifically labelled with the colloidal gold particles 
coated with BSA-NPN antigen. Panel 9B shows a portion of 25 
a mature filamentous phage on the length of which is 
exhibited the labelling of antigen binding sites. 

FIG. 10 illustrates the results of a two-site EUSA for 
assaying for the presence and function of Fab antibody 
attached to the surface of bacteriophage particles as 
described in Example 4b. For expression of Fab antibody on 
phage surfaces. XL I -Blue cells were transformed with the 
phagemid expression vector. pCBAK8-2b. The inducer, 
isopropyl thiogalactopyranoside (IFTG). was admixed with 
the bacterial suspension at a final concentration of 1 niM for 
one hour. Helper phage was then admixed with the bacterial 
suspension to initiate the generation of copies of the sense 
strand of the phagemid DNA. After a two hour maintenance 
period, bacterial supematants containing bacteriophage par- 
ticles were collected for assaying in EUSA. 

Specific titratable binding of NPN-Fab-expressing bacte- 
riophage particles to NPN-coated plates was exhibited. No 
binding was detected with helper phage alone. 

FIG. 11 illustrates the inhibition of NPN-Fab expressing 45 
bacteriophage to NPN antigen- coated plates with the addi- 
tion of increasing amounts of free hapten. The assays were 
performed as described in FIG. 10. Complete inhibition of 
binding was observed with 5 ng of added free NPN hapten. 

FIG. 12 illustrates schematically the process of 50 
mutagenizing the CDR3 region of a heavy chain fragment 
resulting in an alteration of binding specificity. The oligo- 
nucleotide primers are indicated by black bars. The process 
is described in Example 6. 

55 

DETAILED DESCRIPTION OF THE 
INVENTION 



COOH refers to the free carboxy group present at the 
carboxy terminus of a polypeptide. In keeping with standard 
polypeptide nomenclature (described in J. BioL Chenu, 
2433552-59 (1969) and adopted at 37 C-F.R. 1.822(bX2)). 
abbreviations for amino acid residues are shown in the 
following Table of Correspondence: 

TABLE OF CORRESPONDENCE 

SYMBOL 



30 



35 



40 



A. Definitions 



60 



Amino Acid Residue: An amino acid formed upon chemi- 
cal digestion (hydrolysis) of a polypeptide at its peptide 
linkages. The amino acid residues described herein are 
preferably in the lf L w isomeric form. However, residues in 
the "D w isomeric form can be substituted for any I^amino 
acid residue, as long as the desired functional property is 65 
retained by the polypeptide. NH 2 refers to the free amino 
group present at the amino terminus of a polypeptide. 



1 -Letter 


3-Letter 


AMINO ACID 


Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Fbe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


Be 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histkfine 


Q 


Gin 


ghxtamxne 


E 


Glu 


glutamic acid 


2 


Gtx 


Ghi and/or Gtn 


w 


Tip 


tryptophan 


R 


Arg 


argffiine 


D 


Asp 


aspartic acid 


N 


Asa 


asparegine 


B 


Asx 


Asa and/or Asp 


C 


Cys 


cysteine 


J 


Xaa 


Unknown or other 



It should be noted that all amino acid residue sequences 
represented herein by formulae have a left-to-right orienta- 
tion in the conventional direction of amino terminus to 
carboxy terminus. In addition, the phrase "amino acid resi- 
due" is broadly defined to include the amino acids listed in 
the Table of Correspondence and modified and unusual 
amino acids, such as those listed in 37 C.RR. 1.822(b)(4), 
and incorporated herein by reference. Furthermore, it should 
be noted mat a dash at the beginning or end of an amino acid 
residue sequence indicates a peptide bond to a further 
sequence of one or more amino add residues or a covalent 
bond to an amines terminal group such as NH 2 or acetyl or 
to a carboxy-terminal group such as COOH. 

Nucleotide: A monomeric unit of DNA or RNA consisting 
of a sugar moiety (pentose), a phosphate, and a nitrogenous 
heterocyclic base. The base is linked to the sugar moiety via 
the glycosidic carbon (V carbon of the pentose) and that 
combination of base and sugar is a nucleoside. When the 
nucleoside contains a phosphate group bonded to the 3* or 5' 
position of the pentose it is referred to as a nucleotide. A 
sequence of operatively linked nucleotides is typically 
referred to herein as a "base sequence" or "nucleotide 
sequence**, and their grammatical equivalents, and is repre- 
sented herein by a formula whose left to right orientation is 
in the conventional direction of 5'-terminus to 3-tcrminus. 

Base Pair (bp): A partnership of adenine (A) with mymine 
(T), or of cytosine (C) with guanine (G) in a double stranded 
DNA molecule. In RNA. uracil (U) is substituted for thym- 
ine. 

Nucleic Acid: A polymer of nucleotides, either single or 
double stranded. 

Polynucleotide: a polymer of single or double stranded 
nucleotides. As used herein ''polynucleotide'* and its gram- 
matical equivalents will include the full range of nucleic 
acids. A polynucleotide will typically refer to a nucleic acid 
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molecule comprised of a linear strand of two or more 
deoxyribonucleotides and/or ribonucleotides. The exact size 
will depend on many factors, which in turn depends on the 
ultimate conditions of use, as is well known in the art. The 
polynucleotides of the present invention include primers, 
probes. RNA/DNA segments, oligonucleotides or "oligos" 
(relatively short polynucleotides), genes, vectors, plasmids, 
and the like. 

Gene: A nucleic acid whose nucleotide sequence codes for 
an RNA or polypeptide. A gene can be either RNA or DNA. 

Duplex DNA: a double-stranded nucleic acid molecule 
comprising two strands of substantially complementary 
polynucleotides held together by one or more hydrogen 
bonds between each of the complementary bases present in 
a base pair of the duplex. Because the nucleotides that form 
a base pair can be either a ribonucleotide base or a deox- 
yribonucieotide base, the phrase "duplex DNA" refers to 
either a DNA-DNA duplex comprising two DNA strands (ds 
DNA). or an RNA-DNA duplex comprising one DNA and 
one RNA strand. 

Complementary Bases: Nucleotides mat normally pair up 
when DNA or RNA adopts a double stranded configuration. 

Complementary Nucleotide Sequence: A sequence of 
nucleotides in a single-stranded molecule of DNA or RNA 
that is sufficiently complementary to that on another single 
strand to specifically hybridize to it with consequent hydro- 
gen bonding. 

Conserved: A nucleotide sequence is conserved with 
respect to a preselected (reference) sequence if it non- 
randomly hybridizes to an exact complement of the prese- 
lected sequence. 

Hybridization: The pairing of substantially complemen- 
tary nucleotide sequences (strands of nucleic acid) to form 
a duplex or heteroduplex by the establishment of hydrogen 
bonds between complementary base pairs. It is a specific, ie. 
non-random, interaction between two complementary poly- 
nucleotides that can be competitively inhibited. 

Nucleotide Analog: A purine or pyrimidine nucleotide 
that differs structurally from A. T. G. C. or U, but is 
sufficiently similar to substitute for the normal nucleotide in 
a nucleic acid molecule. 

DNA Homolog: Is a nucleic acid having a preselected 
conserved nucleotide sequence and a sequence coding for a 
receptor capable of binding a preselected ligand. 

Recombinant DNA(rDNA) molecule: a DNA molecule 
produced by operatively linking two DNA segments. Thus, 
a recombinant DNA molecule is a hybrid DNA molecule 
comprising at least two nucleotide sequences not normally 
found together in nature. rDNA's not having a common 
biological origin, i.e.. evolutionarily different, are said to be 
"heterologous'*. 

Vector: a rDNA molecule capable of autonomous repli- 
cation in a cell and to which a DNA segment, e.g.. gene or 
polynucleotide, can be operatively linked so as to bring 
about replication of the attached segment Vectors capable of 
directing the expression of genes encoding for one or more 
proteins are referred to herein as "expression vectors". 
Particularly important vectors allow cloning of cDNA 
(complementary DNA) from mRNAs produced using 
reverse transcriptase. 

Receptor A receptor is a molecule, such as a protein, 
glycoprotein and the like, that can specifically (non- 
randomly) bind to another molecule. 

Antibody: The term antibody in its various grammatical 
forms is used herein to refer to immunoglobulin molecules 
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and immunologically active portions of immunoglobulin 
molecules. Le.. molecules lhat contain an antibody combin- 
ing site or paratope. Exemplary antibody molecules are 
intact immunoglobulin molecules, substantially intact 

5 immunoglobulin molecules and portions of an immunoglo- 
bulin molecule, including those portions known in the art as 
Fab. Fab'. F(ab , ) 2 and F(v). 

Antibody Combining Site: An antibody combining site is 
that structural portion of an antibody molecule comprised of 

1° a heavy and light chain variable and hypervariablc regions 
that specifically binds (immunoreacts with) an antigen. The 
term immunoreact in its various forms means specific bind- 
ing between an antigenic deterrninant-containing molecule 
and a molecule containing an antibody combining site such 

15 as a whole antibody molecule or a portion thereof. 

Monoclonal Antibody: The phrase monoclonal antibody 
in its various grammatical forms refers to a population of 
antibody molecules that contains only one species of anti- 
body combining site capable of immunoreacting with a 

20 particular antigen. A monoclonal antibody thus typically 
displays a single binding affinity for any antigen with which 
it immunoreacts. A monoclonal antibody may therefore 
contain an antibody molecule having a plurality of antibody 
combining sites, each immunospecific for a different 

23 antigen, e.g.. a bispecific monoclonal antibody. 

Fusion Protein: A protein comprised of- at least two 
polypeptides and a linking sequence to operatively link the 
two polypeptides into one continuous polypeptide. The two 

30 polypeptides linked in a fusion protein are typically derived 
from two independent sources, and therefore a fusion pro- 
tein comprises two linked polypeptides not normally found 
linked in nature. 

Upstream: In the direction opposite to the direction of 

35 DNA transcription, and therefore going from 5' to 3' on the 
non-coding strand, or 3' to 5* on the mRNA. 

Downstream: Further along a DNA sequence in the direc- 
tion of sequence transcription or read out that is traveling in 
a J- to S'-direction along the non-coding strand of the DNA 

40 or 5'- to 3 -direction along the RNA transcript. 

Cistron: Sequence of nucleotides in a DNA molecule 
coding for an amino acid residue sequence and including 
upstream and downstream DNA expression control ele- 
ments. 

43 Stop Codon: Any of three codons thai do not code for an 
amino acid, but instead cause termination of protein syn- 
thesis. They are UAG, UAA and UGA and are also referred 
to as a nonsense or termination codon. 

50 Leader Polypeptide: A short length of amino acid 
sequence at the amino end of a protein, which carries or 
directs the protein through the inner membrane and so 
ensures its eventual secretion into the periplasmic space and 
perhaps beyond. The leader sequence peptide is commonly 

55 removed before the protein becomes active. 

Reading Frame: Particular sequence of contiguous nucle- 
otide triplets (codons) employed in translation. The reading 
frame depends on the location of the translation initiation 
codon. 

60 

B. Filamentous Phage 

The present invention contemplates a filamentous phage 
comprising a matrix of cpVTH proteins encapsulating a 
genome encoding first and second polypeptides. The phage 
65 further comprises a heteromeric receptor comprised of the 
first and second polypeptides surface-integrated into the 
matrix via a cpVTTJ membrane anchor domain fused to at 
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least odc of the first or second polypeptides. Preferably, the 
first and second polypeptides are W H and V L proteins, 
respectively. 

The first and second polypeptides are capable of autog- 
enous assembly into a functional receptor, which is 
expressed on the outer surface in a manner accessible to 
ligand. i.e. they are surface-integrated into the phage. 
Typically, the receptor is comprised of a linking polypeptide 
that contains the cpVUI membrane anchor domain, such as 
a polypeptide described in Section C. and a non-linking 
polypeptides). 

Because the receptor is linked to the phage in a surface 
accessible manner, the phage can be advantageously used as 
a solid-phase affinity sorbent In preferred embodiments, the 
phage are linked, preferably removably linked, to a solid 
(aqueous insoluble) matrix such as agarose, cellulose, syn- 
thetic reins, polysaccharides and the like. For example, 
transformants shedding the phage can be applied to and 
retained in a column and maintained under conditions mat 
support shedding of the phage. An aqueous composition 
containing a ligand that binds to the receptor expressed by 
the phage is then passed through the column at a predeter- 
mined rate and under receptor-binding conditions to form a 
solid-phase receptor-ligand complex. The column is then 
washed to remove unbound materiaL leaving the ligand 
bound to the solid-phase phage. The ligand can then be 
removed and recovered by washing the column with a buffer 
that promotes dissociation of the receptor-ligand complex. 

Alternatively, purified phage can be admixed with a 
aqueous solution containing the ligand to be affinity purified. 
The receptor/ligand binding reaction admixture thus formed 
is maintained for a time period and under binding conditions 
sufficient for a phage-linked receptor-ligand complex to 
form. The phage-bound ligand (ligand-bearing phage) are 
then separated and recovered from the unbound materials, 
such as by centrifugation. electrophoresis, precipitation, and 
the like. 

C. DNA Expression Vectors 

A vector of the present invention is a recombinant DNA 
(rDNA) molecule adapted for receiving and expressing 
translatable DNA sequences in the form of a fusion protein 
containing a filamentous phage gene VTH membrane anchor 
domain and a prokaryotic secretion signal domain. The 
vector comprises a cassette that includes upstream and 
downstream translatable DNA sequences operatively linked 
via a sequence of nucleotides adapted for directional liga- 
tion. The upstream translatable sequence encodes the secre- 
tion signal. The downstream translatable sequence encodes 
the filamentous phage membrane anchor. The cassette pref- 
erably includes DNA expression control sequences for 
expressing the fusion protein that is produced when a 
translatable DNA sequence is directionaUy inserted into the 
cassette via the sequence of nucleotides adapted for direc- 
tional ligation. 

An expression vector is characterized as being capable of 
expressing, in a compatible hose a structural gene product 
such as a fusion protein of the present invention. 

As used herein, the term 'Vector" refers to a nucleic acid 
molecule capable of transporting between different genetic 
environments another nucleic acid to which it has been 
operatively linked. Preferred vectors are those capable of 
autonomous replication and expression of structural gene 
products present in the DNA segments to which they are 
operatively linked. 

As used herein with regard to DNA sequences or 
segments, the phrase "operatively linked** means the 
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sequences or segments have been covalentiy joined into one 
shard of DNA. whether in single or double stranded form. 

The choice of vector to which a cassette of this invention 
is operatively linked depends directly, as is well known in 

5 the art. on the functional properties desired. e.g.. vector 
replication and protein expression, and the host cell to be 
transformed, these being limitations inherent to the art of 
constructing recombinant DNA molecules. 
In preferred embodiments, the vector utilized includes a 

10 prokaryotic replicon Le.. a DNA sequence having the ability 
to direct autonomous rerMicatioo and maintenance of the 
recombinant DNA molecule extra chromosomally in a 
prokaryotic host cell, such as a bacterial host cell, trans- 
formed therewith. Such replicons are well known in the art 

13 In addition, those embodiments that include a prokaryotic 
replicon also include a gene whose expression- confers a 
selective advantage, such as drug resistance, to a bacterial 
host transformed therewith. Typical bacterial drug resistance 
genes are those that confer resistance to ampicillin or 

20 tetracycline. Vectors typically also contain convenient 
restriction sites for insertion of translatable DNA sequences. 
Exemplary vectors are the plasmids pUC8. pUC9, pBR322. 
and pBR329 available from BioRad Laboratories. 
(Richmond. Calif.) and pPL and pKK223 available from 

25 Pharmacia. (Piscataway. N.J.).' 

A sequence of nucleotides adapted for directional ligation, 
i.e., a polylinker. is a region of the DNA expression vector 
that (1) operatively links for replication and transport the 
upstream and downstream translatable DNA sequences and 

30 (2) provides a site or means for directional ligation of a DNA 
sequence into the vector. Typically, a directional polylinker 
is a sequence of nucleotides that defines two or more 
restriction endonuclease recognition sequences, or restric- 
tion sites. Upon restriction cleavage, the two sites yield 

35 cohesive termini to which a translatable DNA sequence can 
be ligated to the DNA expression vector. Preferably, the two 
restriction sites provide, upon restriction cleavage, cohesive 
termini that are n on -complementary and thereby permit 
directional insertion of a translatable DNA sequence into the 

40 cassette. In one embodiment, the directional ligation means 
is provided by nucleotides present in the upstream translat- 
able DNA sequence, downstream translatable DNA 
sequence, or both. In another embodiment the sequence of 
nucleotides adapted for directional ligation comprises a 

45 sequence of nucleotides that defines multiple directional 
cloning means. Where the sequence of nucleotides adapted 
for directional ligation defines numerous restriction sites, it 
is referred to as a multiple cloning site. 

^ A translatable DNA sequence is a linear series of nucle- 
otides that provide an uninterrupted series of at least 8 
codons that encode a polypeptide in one reading frame. 

An upstream translatable DNA sequence encodes a 
prokaryotic secretion signal. The secretion signal is a leader 

55 peptide domain of protein that targets the protein to the 
periplasmic membrane of gram negative bacteria, 

A preferred secretion signal is a pelB secretion signal. The 
predicted amino acid residue sequences of the secretion 
signal domain from two pelB gene product variants from 

60 Erwinia carotova are shown in Table 1 as described by Lei. 
et aL. Nature, 331:543-546 (1988). A particularly preferred 
pelB secretion signal is also shown in Table 1. 

The leader sequence of the pelB protein has previously 
been used as a secretion signal for fusion proteins. Better et 

65 aL. Science, 240:1041-1043 (1988); Sastry et al., Proc. NaxL 
Acad ScL USA, 86:5728-5732 (1989); and MuUinax et al., 
Proc. NatL Acad. ScL USA 87:8095-8099 (1990). 
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Amino acid residue sequences for other secretion signal 
polypeptide domains fromE. coli useful in this invention are 
also listed in Table 1. Oliver, In Neidhard. F. C. (ed.). 
Escherichia coli and Salmonella Typhimurium. American 
Society for Microbiology. Washington. D.C.. 1:56-69 
. (1987). 

A translatable DNA sequence encoding the pelB secretion 
signal having the amino acid residue sequence shown in 
SEQ. ID. NO. 5 is a preferred DNA sequence for inclusion 
in a DNA expression vector of 

TABLE 1 



Leader Sequences 

SEQ. 

ED. NO. Type, Amino Acid Residue Sequence 

(5) PelB 1 MetLysTytLeuLcuProThrAlaAUAlaGlyLeuLeu 

LcuLeuAUAlaGlnProAJaMet 

(6) PelB 2 MctLysiyd^ulxiJ^ThrAlaAlaAlaGljdxuLeu 

IxuLcuAIaAlaGlnProAl^uiPtoAiaMetAla 

(7) PcIB 3 MetLysSedxuIkTniProBeAlaAJaGlyLeuLeu 

LeuAlaPheScrGlnTyrSciLeuAla 

(8) MaflB 4 MetLysDeLysiTwGtyAlaArxD^uAlaLeuScT 

AhUuThrTVMetMetPheScrAlaSerAlaLcuAla 
Lysfle 

(9) OmpF* MetMctLyiArgAsnDcLcuAiaValDeVaiProAla 

UuLeuValAlaGl/ThiAJaAsiiAlaAlaGlu 

(10) PboA 4 Meiy^Jli^ThrlkAlal^uAlal^ruLcuProLcu 

LeuPteTbjPToVarrhrLysAlaArg'nir 

(11) Bla* MetSerlk^IlnHisPbeArgValAlal^uIkProPbe 

PteAlaAlaPheCysLeuPloValPbcAJaHisPio 

( 1 2) LamB 4 MciMetHeThil<cuArgLy sLeuProLeuAlaValAla * 

\^LAiaAJaGJyVaIMctScrAlaGlnAlaMctAlaVal 
Asp 

(13) Lpp* MctLyvUaThiLyaLCTiValLeuGtyAJaValDcLeu 

GfyScrThrLcuLcuAlaGlyCysSer 

(14) cpVm 5 X^ysLysSeiLeuVULcuLysAlaSerValAJaVal 

AlaThiLeuViJPioMelLeuSerPheAla 
( 13) cpm 6 MetLysLysLeuLeuPheAlalkPio^ 
PheiyrSeTHisSer 

'pdB used in this invention 

2 pdB from Erwinia caroKrvora gene 

3 pdB from Erwinia camkrvora EC 16 gene 

'leader sequences from E coli 

'leader sequence for cpVm 

*]cader sequence for cpIH 

this invention. 

A downstream translatable DNA sequence encodes a 
filamentous phage membrane anchor. Preferred membrane 
anchors are obtainable from filamentous phage M13. f 1. fd. 
and the like. Preferred membrane anchor domains are found 
in the coat proteins encoded by gene ID and gene VUL Thus, 
a downstream translatable DNA sequence encodes an amino 
acid residue sequence mat corresponds, and preferably is 
identical, to the membrane anchor domain of either a fila- 
mentous phage gene TH or gene VUJ coat protein. 

The membrane anchor domain of a filamentous phage 
coat protein is the carboxy terminal region of the coat 
protein and includes a region of hydrophobic amino acid 
residues for spanning a lipid bilayer membrane, and a region 
of charged amino acid residues normally found at the 
cytoplasmic face of the membrane and extending away from 
the membrane. In the phage fl. gene VHI coat protein's 
membrane spanning region comprises residue Trp-26 
through Lys-40. and the cytoplasmic region comprises the 
carboxy- terminal 11 residues from 41 to 52. Ohkawa et al.. 
/ Biol. Chem.. 256:9951-9958 (1981). 

The amino acid residue sequence of a preferred mem- 
brane anchor domain derived from the M13 filamentous 
phage gene III coat protein (also designated cpUT) has a 
sequence shown in SEQ. ID. NO. 16 from residue 1 to 
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residue 211. Gene HI coat protein is present on a mature 
filamentous phage at one end of the phage particle with 
typically about 4 to 6 copies of the coat protein. 
The amino acid residue sequence of a preferred mem- 

5 brane anchor domain derived from the M13 filamentous 
phage gene VIII coat protein (also designated cpVIH) has a 
sequence shown in SEQ. ID. NO. 17 from residue i to 
residue 50. Gene VTH coat protein is present on a mature 
filamentous phage over the majority of the phage particle 
with typically about 2500 to 3000 copies of the coat protein. 

1 For detailed descriptions of the structure of filamentous 
phage particles, their coat proteins and particle assembly, see 
the reviews by Rached et al.. Microbiol. Rev^ 50:401-427 
(1986); and Model et aL. in 'The Bacteriophages: Vol. 2\R. 
Calendar, ed. Plenum Publishing Co.. pp. 375-456. (1988). 

15 A cassette in a DNA expression vector of this invention is 
the region of the vector that forms, upon insertion of a 
translatable DNA sequence, a sequence of nucleotides 
capable of expressing, in an appropriate host a fusion 
protein of this invention. The expression- competent 

20 sequence of nucleotides is referred to as a cistron. Thus, the 
cassette comprises DNA expression control elements opera- 
tively linked to the upstream and downstream translatable 
DNA sequences. A cistron is formed when a translatable 
DNA sequence is directionally inserted (directionally 

25 ligated) between the upstream and downstream sequences 
via the sequence of nucleotides adapted for that purpose. 
The resulting three translatable DNA sequences, namely the 
upstream, the inserted and the downstream sequences, are all 
operatively linked in the same reading frame. 

DNA expression control sequences comprise a set of 

30 DNA expression signals for expressing a structural gene 
product and include both 5* and 3' elements, as is well 
known, operatively linked to the cistron such that the cistron 
is able to express a structural gene product. The 5' control 
sequences define a promoter for initiating transcription and 

35 a ribosome binding site operatively linked at the 5' terminus 
of the upstream translatable DNA sequence. 

To achieve high levels of gene expression in E. coli, it is 
necessary to use not only strong promoters to generate large 
quantities of mRNA. but also ribosome binding sites to 

40 ensure that the mRNA is efficiently translated. In E. coli. the 
ribosome binding site includes an initiation codon (AUG) 
and a sequence 3-9 nucleotides long located 3-11 nucle- 
otides upstream from the initiation codon (Shine et aL. 
Nature, 25434 (1975)] The sequence, AGGAGGU. which is 

45 called the Shine- Dalgamo (SD) sequence, is complementary 
to the 3' end of £. coli 16S mRNA. Binding of the ribosome 
to mRNA and the sequence at the 3' end of the mRNA can 
be affected by several factors: 

(i) The degree of complementarity between the SD 
50 sequence and 3' end of the 16S tRNA. 

(ii) The spacing and possibly the DNA sequence lying 
between the SD sequence and the AUG [Roberts et al., 
Proc. Natl Acad. ScL USA, 76:760 (1979a); Roberts et 
al.. Proc. Natl. Acad ScL USA, 76:5596 (1979b); 

55 Guarente et al.. Science, 209:1428 (1980); and 
Guarente et aL. Cell, 20:543 (1980).] Conization is 
achieved by measuring the level of expression of genes 
in plasmids in which this spacing is systematically 
altered. Comparison of different mRNAs shows that 

60 there are statistically preferred sequences from posi- 
tions -20 to +13 (where the A of the AUG is position 
0) [Gold et al.. Annu. Rev. Microbiol, 35:365 (1981)]. 
Leader sequences have been shown to influence trans- 
lation drarnatically (Roberts et al.. 1979 a, b supra). 

65 (iii) The nucleotide sequence following the AUG. which 
affects ribosome binding [Taniguchi et aL. J. MoL Biol^ 
118:533 (1978)]. 
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Useful ribosome binding sites arc shown in Tabic 2 below. replication. Particularly preferred is a filamentous phage 

origin of replication having a sequence shown in SEQ. ID. 
TABLE 2 N0 * 117 30(1 described by Short et al.. NucL Acids Res^ 

. — 16:7583-7600 (1988). Preferred DNA expression vectors 

Ribosome p ^rfin^ sites* 5 aj C the dicistronic expression vectors pCOMB8 and 

pCOMB3 described in Example lb(i) and lb(ii). respec- 
tively. 



SEQ. 
ID. NO. 



1. (18) 5' aaucirj ggaGG CUUUUU Uau^ ^cguucu D. Polypeptides 

1 (19) 5' UAACUAAGGAUGAAAUGC'AUjJUCUAAGACA ... . 

3. (20) 5' UCC UAGGAGGtA AJGACCUATGCGAGCUUUU 10 In another embodiment, the present invention comem- 

4! (21) 5" AUGUAC UAAGOAC3GU UG UXUG GAACAACGC plates a polypeptide comprising an insert domain flanked by 

" — " : . . . , . T~ an ammo-terminal secretion signal domain and a carboxy- 

^S^ f ^^i rc8i0nS ^ SyGt ^ m terminal filamentous phage coat protein membrane anchor 

AUG = initiation codon (double underlined) domain. 

1. = Phage $xi74 geoc-A protein 15 preferably, the polypeptide is a fusion protein having a 

^SgeK^proteta receptor domain comprised' of an amino acid residue 

4. = Phage lambda gene-oo protein sequence that defines the ligand binding domain of a recep- 

_ . . , „ - .. tor protein positioned between a prokaryotic secretion signal 

The 3' control sequences define at least one term.nation ^ f vm-encoded (cpVHI) membrane anchor 

(stop) codon in and l^ely Unked to the M ^ embodiments^ receptor protein isa 

d °S^S!^^i^lM^^ polypcptidec^ofahe,^^ 

a system for cloning translatable DNA sequences into the polypeptide has a receptor domain. >t is preferred to 

cassette portion of the vector to produce a cistron capable of herein as a receptor. In other preferred ernbodiments the 

expressing a fusion protein of this invention. secretion signal domain is a pelB secretion signal as 

In preferred embodiments, a DNA expression vector described herein, 

provides a system for independently cloning two translatable Preferred heterodimeric receptors include 

DNA sequences into two separate cassettes present in the immunoglobulins, major histocompatibility antigens of 

vector, to form two separate tistrons for expressing both j or IL lymphocyte receptors, in te grins and the like 

polypeptides of a heterodimeric receptor, or the ligand heterodimeric receptors. 

binding portions of the polypeptides that comprise : a i net- » ^ one embodiment, a polypeptide of this invention has an 
erodimeric receptor. The : DN A anL acid residue sequence** can be represented by the 

express^gtwocistronsisreferredtoasadiastronicexpres- ^ ^ ^ of , 0 carboxy 

sion vector. 

Thus, a preferred DNA expression vector of this invention terminus: 

comprises, in addition to the cassette previously described in 35 m ^o-^tn,.— v-<X)u— Z—COOH, (Fi) 

detail, a second cassette for expressing a second fusion 2 

protein. The second cassette includes a third translatable where O represents an amino acid residue sequence defining 

DNA sequence that encodes a secretion signal, as defined a secretion signal. U represents a first spacer polypeptide. V 

herein before, operatively linked at its 3' terminus via a represents an amino acid residue sequence defining a recep- 

sequence of nucleotides adapted for directional ligation to a ^ tor ^main, X represents a second spacer polypeptide, and Z 

downstream DNA sequence defining at least one stop codon. represents an amino acid residue sequence defining a fila- 

The third translatable DNA sequence is operatively linked at memous phage coat protein membrane anchor, with the 

its 5* terminus to DNA expression control sequences forming p^y^ ^ m is fo c integer 0 or 1 such that when m is O. 

the 5' elements defined above. The second cassette is y . $ ^ ^ afld whefl m is L tj is present and n is 0 

capable, upon insertion of a translatable DNA sequence, of 45 ^ { when n ^ q. X is not present and when n is 

expressing the second fusion protein. 1 Y ic nrr^nt 

Ip a preferred embodiment a DNA expression v^tor is £ ^ me ^ „ d Ae to 

designed foe convenient manipulation in the form 01 a v ' . . 0 , a~k-~a 

^^^r^t^kc^Aing to the teachings of the tous phage coat protein membrane ^ncborare as defined 

Sn^eS a DNA expression herein above. Particularly preferred is a polypepUde accord- 

filamentous phage origin of replication such that the vector. anchor as described herein. In another preferred embodi- 

upon presentation of the appropriate genetic ment the secretion signal is the pelB secreDon signal, 

complementation, can replicate as a filamentous phage in In one embodiment. V is an amino acid residue sequence 

single stranded replicative form and be packaged into fila- that defines the ligand binding domain of a chain of a 

mentous phage particles. This feature provides the ability of 55 heterodimeric receptor molecule, and preferably is an lmrau- 

the DNA expression vector to be packaged into phage noglobuiin variable region polypeptide. In a particularly 

particles for subsequent segregation of the particle, and preferred polypeptide V is a V„ or V L polypeptide. Most 

vector contained therein, away from other particles that preferred is a polypeptide where V is an irnmunoglobulin V H 

comprise a population of phage particles. polypeptide, and m and n arc both zero. 

A filamentous phage origin of replication is a region of the «o In another embodiment, U or X can define a proteolytic 

phage genome, as is well known, that defines sites for cleavage site, such as the sequence of amino acids found in 

initiation of replication, termination of replication and pack- a precursor protein, such as prothrombin, factor X and the 

aging of the replicative form produced by replication. See. like, that defines the site of cleavage of the protein. A fusion 

for example. Rasched et al, Microbiol /?*u, 50:401^27 protein having a cleavage site provides a means to purify the 

(1986); and Horiuchi. J, Mol Biol, 188:215-223 (1986). 65 protein away from the phage particle to which it is attached. 

A preferred filamentous phage origin of replication for use The polypeptide spacers U and X can each have any 

in the present invention is a M 13. f 1 or fd phage origin of sequence of amino acid residues of from about 1 to 6 amino 
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acid residues in length. Typically the spacer residues are 
present in a polypeptide to accommodate the continuous 
reading frame that is required when a polypeptide is pro- 
duced by the methods disclosed herein using a DNA expres- 
sion vector of this invention. 5 

A receptor of the present invention assumes a conforma- 
tion having a binding site specific for. as evidenced by its 
ability to be competitively inhibited, a preselected or pre- 
determined ligand such as an antigen, enzymatic substrate 
and the like. In one embodiment, a receptor of this invention 10 
is a ligand binding polypeptide that forms an antigen binding 
site which specifically binds to a preselected antigen to form 
a complex having a sufficiently strong binding between the 
antigen and the binding site for the complex to be isolated. 
When the receptor is an antigen binding polypeptide its 15 
affinity or avidity is generally greater than 10 3 M" 1 more 
usually greater than 10 6 and preferably greater man 10 s M" 1 . 

In another embodiment a receptor of the subject inven- 
tion binds a substrate and catalyzes the formation of a 
product from the substrate. While the topology of the ligand 20 
binding site of a catalytic receptor is probably more impor- 
tant for its preselected activity than its affinity (association 
constant or pKa) for the substrate, the subject catalytic 
receptors have an association constant for the preselected 
substrate generally greater than 10 3 M" 1 , more usually 25 
greater than 10 5 M" 1 or 10 6 M" 1 and preferably greater than 
10 7 M" 1 . 

Preferably the receptor produced by the subject invention 
is heterodimeric and is therefore normally comprised of two 
different polypeptide chains, which together assume a coo- 30 
formation having a binding affinity, or association constant 
for the preselected ligand that is different preferably higher, 
than the affinity or association constant of either of the 
polypeptides alone, i.e.. as monomers. One or both of the 
different polypeptide chains is derived from the variable 35 
region of the light and heavy chains of an immunoglobulin. 
Typically, polypeptides comprising the ligjit (VJ and heavy 
(V w ) variable regions are employed together for binding the 
preselected ligand. 

A receptor produced by the subject invention can be 40 
active in monomelic as well as multimeric forms, either 
homomeric or heteromeric. preferably heterodimeric. For 
example. V H and ligand binding polypeptide produced 
by the present invention can be advantageously combined in 
the heterodimer to modulate the activity of either or to 45 
produce an activity unique to the heterodimer. 

The individual ligand polypeptides will be referred to as 
V w and V t and the heterodimer will be referred to as a Fv. 
However, it should be understood that a V„ may contain in 
addition to die V w . substantially all or a portion of the heavy 50 
chain constant region. Similarly, a may contain, in 
addition to the V t , substantially all or a portion of the light 
chain constant region. A heterodimer comprised of a V H 
containing a portion of the heavy chain constant region and 
a W L containing substantially all of the light chain constant 55 
region is termed a Fab fragment. The production of Fab can 
be advantageous in some situations because the additional 
constant region sequences contained in a Fab as compared to 
a Fv can stabilize the V H and interaction. Such stabili- 
zation can cause the Fab to have higher affinity for antigen. 60 
In addition the Fab is more commonly used in the art and 
thus there are more commercial antibodies available to 
specifically recognize a Fab in screening procedures. 

The individual V„ and V L polypeptides can be produced 
in lengths equal to or substantially equal to their naturally 65 
occurring lengths. However, in preferred embodiments, the 
V w and V L polypeptides will generally have fewer than 125 
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amino acid residues, more usually fewer than about 120 
amino acid residues, while normally having greater than 60 
amino acid residues, usually greater than about 95 amino 
acid residues, more usually greater than about 100 amino 
acid residues. Preferably, the V„ will be from about 1 10 to 
about 230 amino acid residues in length while V £ will be 
from about 95 to about 214 amino acid residues in length. 
V H and V L chains sufficiently long to form Fabs are pre- 
ferred. 

The amino acid residue sequences will vary widely, 
depending upon the particular idiotype involved. Usually, 
there will be at least two cysteines separated by from about 
60 to 75 amino acid residues and joined by a disulfide bond. 
The polypeptides produced by the subject invention will 
normally be substantial copies of idiotypes of the variable 
regions of the heavy and/or light chains of 
immunoglobulins, but in some situations a polypeptide may 
contain random mutations in amino acid residue sequences 
in order to advantageously improve the desired activity. 

In some situations, it is desirable to provide for covalent 
cross linking of the V H and polypeptides, which can be 
accomplished by providing cysteine resides at the carboxyl 
termini. The polypeptide will normally be prepared free of 
the immunoglobulin constant regions, however a small 
portion of the J region may be included as a result of the 
advantageous selection of DNA synthesis primers. The D 
region will normally be included in the transcript of the V w . 

Typically the C terminus region of the V w and V £ 
polypeptides will have a greater variety of sequences than 
the N terminus and. based on the present strategy, can be 
further modified to permit a variation of the normally 
occurring V„ and chains. A synthetic polynucleotide can 
be employed to vary one or more amino acid in a hyper- 
variable region. 

In another embodiment, the invention contemplates a 
heterodimeric receptor molecule comprised of two polypep- 
tide chains, at least one of which is a polypeptide of this 
invention. Preferably, the polypeptide comprises a receptor 
domain derived from an immunoglobulin variable chain, 
more preferably a V w . More preferred is a heterodimeric 
receptor comprising receptor domains from both V H and V t 
chains. . 

E. Methods for Producing a Library 
1. General Rationale 

In one embodiment the present invention provides a 
system for the simultaneous cloning and screening of pre- 
selected ligand-binding specificities from gene repertoires 
using a single vector system. This system provides linkage 
of cloning and screening methodologies and has two 
requirements. First, that expression of the polypeptide 
chains of a heterodimeric receptor in an in vitro expression 
host such as E. coli requires coexpression of the two 
polypeptide chains in order that a functional heterodimeric 
receptor can assemble to produce a receptor that binds 
ligand. Second, that screening of isolated members of the 
library for a preselected ligand-binding capacity requires a 
means to correlate the binding capacity of an expressed 
receptor molecule with a convenient means to isolate the 
gene that encodes the member from the library. 

Linkage of expression and screening is accomplished by 
the combination of targeting of a fusion protein into the 
periplasm of a bacterial cell to allow assembly of a func- 
tional receptor, and the targeting of a fusion protein onto the 
coat of a filamentous phage particle during phage assembly 
to allow for convenient screening of the library member of 
interest. Periplasmic targeting is provided by the presence of 
a secretion signal domain in a fusion protein of this inven- 
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tion. Targeting to a phage particle is provided by the A gene can be identified as belonging to a repertoire of 

presence of a filamentous phage coat protein membrane conserved genes using several methods. For example, an 

anchor domain in a fusion protein of this invention. isolated gene may be used as a hybridization probe under 

The present invention describes in one embodiment a low stringency conditions to detect other members of the 

method for producing a library of DNA molecules, each 5 repertoire of conserved genes present in genomic DNA 

DNA molecule comprising a cistron for expressing a fusion usmg me methods described by Southern. J. MoL BioL 

protein on the surface of a filamentous phage particle. The 98:503 (1975). If the gene used as a hybridization probe 

method comprises the steps of (a) forming a ligation admix- hybridizes to multiple restriction endonuclease fragments of 

ture by combining in a ligation buffer (i) a repertoire of ^ gcnomc , that gene is a member of a repertoire of 

polypeptide encoding genes and (ii) a plurality of DNA JQ conscrvcd gcncs . 

expression vectors in linear form adapted to form a fusion immunoglobulins 

protein expressing cistron. and (b) subjecting die admixture ^ imjnuno ^ obulinSi or mo i ccu les. are a large 

to ligation conditions for a time ^.^^^^ family of molecules that include several types of molecules, 

repertoire of ge nes to become opcrauvely Linked (ligated) to * ^ molecule 

the plurality of vectors to form the library. . ^ ^ 7^7 X_ , /TT x a .-Jn \ 

In P thiseLx>o^ 15 »»^ y€aa *^^^ 

ing genes are in the form of o^bie-stranded (ds) DNA and with both a variable (V) and constant (C) region present on 

each [member of the repertoire has cohesive termini adapted each chain as shown in FIG. 1. Schematic diagrams of 

for directional ligation. In addition, the plurality of DNA human IgG heavy chain and human kappa light chain are 

expression vectors are each linear DNA molecules having shown in FIGS. 2A and 2B. respectively. Several different 

upstream and downstream cohesive termini that are (a) 20 regions of an immunoglobulin con tain conserved sequences 

adapted for directionaUy receiving the polypeptide genes in useful for isolating an immunoglobulin repertoire. Extensive 

a common reading frame, and (b) operatively linked to amino acid and nucleic acid sequence data displaying exem- 

respective upstream and downstream translatable DNA plary conserved sequences is compUed for immunoglobulin 

sequences. The upstream translatable DNA sequence molecules by Kabat et al.. in Sequences of Proteins of 

encodes a secretion signal, preferably a pelB secretion 25 Immunological Interest, National Institutes of Health, 

signal, and the downstream translatable DNA sequence Bethesda, Md.. 1987. 

encodes a filamentous phage coat protein membrane anchor The C region of the H chain defines the particular immu- 

as described herein for a polypeptide of this invention. The noglobulin type. Therefore the selection of conserved 

translatable DNA sequences are also operatively linked to sequences as defined herein from the C region of the H chain 

respective upstream and downstream DNA expression con- 30 results in the preparation of a repertoire of immunoglobulin 

trol sequences as defined for a DNA expression vector genes having members of the immunoglobulin type of the 

described herein. selected C region. 

The library so produced can be utilized for expression and The V region of the H or L chain typically comprises four 

screening of the fusion proteins encoded by the resulting framework (FR) regions each containing relatively lower 

library of cistrons represented in the library by the expres- 35 degrees of variability that includes lengths of conserved 

sion and screening methods described herein. sequences. The use of conserved sequences from the FR1 

2. Production of Gene Repertoires and FR4 (J region) framework regions of the V H chain is a 

A gene repertoire is a collection of different genes. preferred exemplary embodiment and is described herein in 
preferably polypep tide-encoding genes (polypeptide genes), the Examples. Framework regions are typically conserved 
and may be isolated from natural sources or can be generated 40 across several or all immunoglobulin types and thus con- 
artificially. Preferred gene repertoires are comprised of con- served sequences contained therein are particularly suited 
served genes. Particularly preferred gene repertoires com- for preparing repertoires having several irnmunoglobuiin 
prise either or both genes that code for the members of a types. 

dimeric receptor molecule. Major Histocompatibility Complex 

A gene repertoire useful in practicing the present inven- 45 The major histocompatibility complex (MHC) is a large 

tion contains at least 10 3 . preferably at least 10 4 , more genetic locus that encodes an extensive family of proteins 

preferably at least 10 5 . and most preferably at least 10 7 that include several classes of molecules referred to as class 

different genes. Methods for evaluating the diversity of a I. class n or class IH MHC molecules. Paul et al.. in 

repertoire of genes is well known to one skilled in the art . Fundamental Immunology. Raven Press. N.Y., pp. 303-378 

Thus, in one emrxxliment, the present invention contem- 50 ( 1984). 

plates a method of isolating a pair of genes coding for a Class I MHC molecules are a polymorphic group of 

dimeric receptor having a preselected activity from a rep- transplantation antigens representing a conserved. family in 

ertoire of conserved genes. Additionally, expressing the which the antigen is comprised of a heavy chain and a 

cloned pair of genes and isolating the resulting expressed non-MHC encoded ligjit chain. The heavy chain includes 

dimeric receptor protein is also described. Preferably, the 55 several regions, termed the N. CI, C2. membrane and 

receptor will be a heterodimeric polypeptide capable of cytoplasmic regions. Conserved sequences useful in the 

binding a ligand, such as an antibody molecule or immu- present invention are found primarily in the N, CI and C2 

nologically active portion thereof, a cellular receptor, or a regions and are identified as continuous sequences of 

cellular adhesion protein coded for by one of the members "invariant residues'* in Kabat et al, supra, 

of a family of conserved genes, i.e., genes containing a 60 Class II MHC molecules comprise a conserved family of 

conserved nucleotide sequence of at least about 10 nude- polymorphic antigens that participate in immune respoo- 

otides in length. siveness and are comprised of an alpha and a beta chain. The 

Exemplary conserved gene families encoding different genes coding for the alpha and beta chain each include 

polypeptide chains of a dimeric receptor are those coding for several regions that contain conserved sequences suitable for 

immunoglobulins, major histocompatibility complex and- 65 producing MHC class C alpha or beta chain repertoires, 

gens of class I or U. lymphocyte receptors, integrins and the Exemplary conserved nucleotide sequences include those 

like coding for amino acid residues 26-30 of the Al region. 
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residues 161-170 of the A2 region and residues 195-206 of been recently challenged results in producing a repertoire 

the membrane region, all of the alpha chain. Conserved that is not biased towards the production of high affinity V„ 

sequences are also present in the Bl, B2 and membrane and/or V L polypeptides. 

regions of the beta chain at nucleotide sequences coding for It should be noted the greater the genetic heterogeneity of 

amino add residues 41^*5. 150-162 and 200-209. respec- 5 the population of cells for which the nucleic acids are 

tively obtained, the greater the diversity of the immunological 

Lymphocyte Receptors and Cell Surface Antigens repertoire (comprising and V r coding genes) that will 

Lymphocytes contain several families of proteins on their be made available for screening according to the method of 

cell surfaces including the T-cell receptor. Thy- 1 antigen and the present invention. Thus, cells from different individuals, 

numerous T-cell surface antigens including the antigens 10 particularly those having an immunologically significant age 

defined by the monoclonal antibodies 01CT4 (leu3). OKT5/8 difference, and cells from individuals of different strains. 

(Ieu2), OKT3, OKT1 (leul), OKT 11 (leu5) OKT6 and races or species can be advantageously combined to increase 
OKT9. Paul, supra at pp. 458-479. . the heterogeneity (diversity) of a repertoire. 

The T-cell receptor is a term used for a family of antigen Thus, in one preferred ernbodiment, the source cells are 

binding molecules found on the surface of T-cells. The T-cell 15 obtained from a vertebrate, preferably a mammal, which has 

receptor as a family exhibits polymorphic binding specificity been immunized or partially immunized with an antigenic 

similar to immunoglobulins in its diversity. The mature iigand (antigen) against which activity is sought i.e.. a 

T-cell receptor is comprised of alpha and beta chains each preselected antigen. The immunize tioq can be carried out 

having a variable (V) and constant (C) region. The similari- conventionally. Antibody titer in the animal can be moni- 

ties that the T-cell receptor has to immunoglobulins in 20 tored to determine the stage of immunization desired, which 

genetic organization and function shows that T-cell receptor stage corresponds to the amount of enrichment or biasing of 

contains regions of conserved sequence. Lai et at. Nature, the repertoire desired. Partially immunized animals typically 

33 1*543-546 (1988). receive only one immunization and cells are collected from 

Exemplary conserved sequences include those coding for those animals shortly after a response is detected. Fully 

amino acid residues 84-90 of alpha chain, amino acid 25 immunized animals display a peak titer, which is achieved 

residues 107-115 of beta chain, and amino acid residues with one or more repeated injections of the antigen into the 

91-95 and 111-116 of the gamma chain. Rabat et at, supra. host mammal, normally at 2 to 3 week intervals. Usually 

279. three to five days after the last challenge, the spleen is 

Integrins And Adhesions removed and the genetic repertoire of the splenocytes. about 

Adhesive proteins involved in cell attachment are mem- 30 90% of which are rearranged B cells, is isolated using 

bers of a large family of related proteins termed integrins. standard procedures. See, Current Protocols in Molecular 

Integrins are heterodimers comprised of a beta and an alpha Biology. Ausubel et al.. eds.. John Wiley & Sons. NY. 

subunit. Members of the integrin family include the cell Nucleic acids coding for V„ and V L polypeptides can be 

surface glycoproteins platelet receptor GpHb-IIIa. vitronec- derived from cells producing IgA. IgD. IgE, IgG or IgM, 

tin receptor (VnR). fibronectin receptor (FnR) and the leu- 35 most preferably from IgM and IgG. producing cells, 

kocyte adhesion receptors LFA-1. Mac-1. Mo-1 and 60.3. Methods for preparing fragments of genomic DNA from 

Rouslahti et aL, Science, 238:491-497 (1987). Nucleic acid which immunoglobulin variable region genes can be cloned 

and protein sequence data demonstrates regions of con- as a diverse population arc well known in the art. See for 

served sequences exist in the members of these families. example Herrmann et al.. Methods In EnzymoL, 

particularly between the beta chain of GpOb-ma. VnR and 40 152:180-183. (1987); Frischauf, Methods In EnzymoL, 

FnR and between the alpha subunit of VnR, Mac-1. LFA-1. 152:183-190 (1987); Frischauf. Methods In EnzymoL, 

FnR and GpHb-TIIa. Suzuki et al.. Proc. NatL Acad. ScL 152:190-199 (1987); and DiLeila et al.. Methods In 

USA, 83:8614-8618. 1986; Ginsberg et al.. h BioL Chenu. EnzymoL, 152:199-212 (1987). (The teachings of the ref- 

262:5437-5440. 1987. erences cited herein are hereby incorporated by reference.) 

Various well known methods can be employed to produce 45 The desired gene repertoire can be isolated from either 

a useful gene repertoire. For instance. V„ and V L gene genomic material coritaining the gene expressing the vari- 

repertoires can be produced by isolating V^ and V^-coding able region or the messenger RNA (mRNA) which repre- 

mRNA from a heterogeneous population of antibody pro- sents a transcript of the variable region. The difficulty in 

ducing cells, i.e.. B lymphocytes (B cells), preferably rear- using the genomic DNA from other than non-rearranged B 

ranged B cells such as those found in the circulation or 50 lymphocytes is in juxtaposing the sequences coding for the 

spleen of a vertebrate. Rearranged B cells are those in which variable region, where the sequences are separated by 

immunoglobulin gene translocation, i.e., rearrangement, has introns. The DNA fragment(s) containing the proper exons 

occurred as evidenced by the presence in the cell of mRNA must be isolated, the introns excised, and the exons then 

with the immunoglobulin gene V, D and J region transcripts spliced in the proper order and in the proper orientation. For 

adjacently located thereon. Typically, the B cells are col- 55 the most part, this will be difficult, so that the alternatiye 

lected in a 1-100 ml sample of blood which usually contains technique employing rearranged B cells will be the method 

10 6 B cells/mL of choice because the V. D and J immunoglobulin gene 

In some cases, it is desirable to bias a repertoire for a regions have translocated to become adjacent so that the 

preselected activity, such as by using as a source of nucleic sequence is continuous (free of introns) for the entire 

acid cells (source cells) from vertebrates in any one of 60 variable regions. 

various stages of age, health and immune response. For Where mRNA is utilized the cells will be lysed under 

example, repeated immunization of a healthy animal prior to RNase inhibiting conditions. In one ernbodiment the first 

collecting rearranged B cells results in obtaining a repertoire step is to isolate the total cellular mRNA. Poly A+ mRNA 

enriched for genetic material producing a receptor of high can then be selected by hybridization to an oiigo-dT cellu- 

affinity. Mullinax et al.. Proc. NatL Acad. ScL USA 65 lose column. The presence of mRNAs coding for the heavy 

87:8095-8099 (1990). Conversely, collecting rearranged B and/or light chain polypeptides can then be assayed by 

cells from a healthy animal whose immune system has not hybridization with DNA single strands of the appropriate 
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genes. Conveniently, the sequences coding for the constant 
portion of the V w and V L can be used as polynucleotide 
probes, which sequences can be obtained from available 
sources. See for example. Early and Hood. Genetic 
Engineering, Setlow and Hollaender, eds.. Vol. 3. Plenum 
Publishing Corporation. NY. (1981), pages 157-188; and 
Kabat et aL. Sequences of Immunological Interest. National 
Institutes of Health, Bethesda, Md.. (1987). 

In preferred embodiments, the preparation containing the 
total cellular mRNA is first enriched for the presence of V„ 
and/or coding mRNA. Enrichment is typically accom- 
plished by subjecting the total mRNA preparation or par- 
tially purified mRNA product thereof to a primer extension 
reaction employing a polynucleotide synthesis primer as 
described herein. Exemplary methods for producing V H and 
gene repertoires using polynucleotide synthesis primers 
are described in PCX Application No. PCT/US 90/02836 
(International Publication No. WO 90/14430). Particularly 
preferred methods for producing a gene repertoire rely on 
the use of preselected oligonucleotides as primers in a 
polymerase chain reaction (PCR) to form PCR reaction 
products as described herein. 

In preferred embodiments, isolated B cells are immunized 
in vitro against a preselected antigen. In vitro immunization 
is defined as the clonal expansion of epitope-specific B cells 
in culture, in response to antigen stimulation. The end result 
is to increase the frequency of antigen-specific B cells in the 
immunoglobulin repertoire, and thereby decrease the num- 
ber of clones in an expression library that must be screened 
to identify a clone expressing an antibody of the desired 
specificity. The advantage of in vitro mimunization is that 
human monoclonal antibodies can be generated against a 
limitless number of therapeutically valuable antigens, 
including toxic or weak immunogens. For example, anti- 
bodies specific for the polymorphic determinants of tumor- 
associated antigens, rheumatoid factors, and histocompat- 
ibility antigens can be produced, which can not be elicited 
in immunized animals. In addition, it may be possible to 
generate immune responses which are normally suppressed 
in vivo. 

In vitro immunization can be used to give rise to either a 
primary or secondary immune response. A primary immune 
response, resulting from first time exposure of a B cell to an 
antigen, results in clonal expansion of epitope-specific cells 
and the secretion of IgM antibodies with low to moderate 
apparent affinity constants (lOMO 8 M~ l ). Primary immu- 
nization of human splenic and tonsillar lymphocytes in 
culture can be used to produce monoclonal antibodies 
against a variety of antigens, including cells, peptides, 
macromolecule. haptens, and tumor-associated antigens. 
Memory B cells from immunized donors can also be stimu- 
lated in culture to give rise to a secondary immune response 
characterized by clonal expansion and the production of 
high afiinity antibodies (>U? M~ J ) of the IgG isotype. 
particularly against viral antigens by clonally expanding 
sensitized lymphocytes derived from seropositive individu- 
als. 

In one embodiment peripheral blood lymphocytes are 
depleted of various cytolytic cells that appear to down- 
modulate antigen-specific B cell activation. When 
lysosome-rich subpopulations (natural killer ceils, cytotoxic 
and suppressor T cells, monocytes) are first removed by 
treatment with the ly sosmotropic methyl ester of leucine, the 
remaining cells (including B cells. T helper cells, accessory 
cells) respond antigen-specifically during in vitro immuni- 
zation. The lymph okinc requirements for inducing antibody 
production in culture are satisfied by a culture supernatant 
from activated, irradiated T cells. 
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In addition to in vitro immunization, cell panning 
(immunoaflfinity absorption) can be used to further increase 
the frequency of antigen-specific B cells. Techniques for 
selecting B cell subpopulations via solid-phase antigen 

5 binding are well established. Panning conditions can be 
optimized to selectively enrich for B cells which bind with 
high affinity to a variety of antigens, including cell surface 
proteins. Panning can be used alone, or in combination with 
in vitro immunization to increase the frequency of antigen- 

10 specific cells above the levels which can be obtained with 
either technique alone. Immunoglobulin expression libraries 
constructed from enriched populations of B cells are biased 
in favor of antigen-specific antibody clones, and thus, 
enabling identification of clones with the desired specifici- 

15 ties from smaller, less complex libraries. 
3. Preparation of Polynucleotide Primers 

The term "polynucleotide** as used herein in reference to 
primers, probes and nucleic acid fragments or segments to 
be synthesized by primer extension is defined as a molecule 

20 comprised of two or more deoxyribonucle otide or 
ribonucleotides, preferably more than 3. Its exact size will 
depend on many factors, which in turn depends on the 
ultimate conditions of use. 
The term "primer*' as used herein refers to a polynucle- 

25 otide whether purified from a nucleic acid restriction digest 
or produced synthetically, which is capable of acting as a 
point of initiation of nucleic acid synthesis when placed 
under conditions in which synthesis of a primer extension 
product which is complementary to a nucleic acid strand is 

30 induced. Le., in the presence of nucleotides and an agent for 
polymerization such as DNA polymerase, reverse tran- 
scriptase and the like, and at a suitable temperature and pH. 
The primer is preferably single stranded for maximum 
efficiency, but may alternatively be in double stranded form, 

35 If double stranded, the primer is first treated to separate it 
from its complementary strand before being used to prepare 
extension products. Preferably, the primer is a polydeoxyri- 
bonucleotide. The primer must be sufficiently long to prime 
the synthesis of extension products in the presence of the 

40 agents for polymerization. The exact lengths, of the primers 
will depend on may factors, including temperature and the 
source of primer. For example, depending on the complexity 
of the target sequence, a polynucleotide primer typically 
contains 15 to 25 or more nucleotides, although it can 

45 contain fewer nucleotides. Short primer molecules generally 
require cooler temperatures to form sufficiently stable hybrid 
complexes with template. 

The primers used herein are selected to be "substantially** 
complementary to the different strands of each specific 

50 sequence to be synthesized or amplified. This means that the 
primer must be sufficiently complementary to non-randomly 
hybridize with its respective template strand. Therefore, the 
primer sequence may or may not reflect the exact sequence 
of the template. For example, a non-complementary nucle- 

55 otide fragment can be attached to the 5* end of the primer, 
with the remainder of the primer sequence being substan- 
tially complementary to the strand. Such non- 
complementary fragments typically code for an endonu- 
clease restriction site. Alternatively, non-complementary 

60 bases or longer sequences can be interspersed into the 
primer, provided the primer sequence has sufficient comple- 
mentarily with the sequence of the strand to be synthesized 
or amplified to non-randomly hybridize therewith and 
thereby form an extension product under polynucleotide 

65 synthesizing conditions. 

Primers of the present invention may also contain a 
DNA-dependent RNA polymerase promoter sequence or its 
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complement. Sec for example. Krieg et al.. NucL Acids Res., V L coding DNA homolog. first primers are chosen to hybrid- 

12:7057-70 (1984); Studier et al.. J. Mol. Biol., ize with (i.e. be complementary to) a conserved region 

189:113-130 (1986); and Molecular Cloning: A Laboratory within the J region or constant region of immunoglobulin 

Manual Second Edition. Maniatis et aL. eds.. Cold Spring light chain genes and the like. Second primers become part 

Harbor. N.Y. (1989). 5 of the coding (plus) strand and hybridize to a nucleotide 

When a primer containing a DNA-dependent RNA poly- sequence conserved among minus strands. To produce the 

merase promoter is used the primer is hybridized to the V^coding DNA homologs. second primers arc therefore 

polynucleotide strand to be amplified and the second poly- chosen to hybridize with a conserved nucleotide sequence at 

nucleotide strand of the DNA-dependent RNA polymerase the 5' end of the V^oding immunoglobulin gene such as in 

promoter is completed using an inducing agent such as E. io that area coding for the leader or first framework region. It 

coli DNA polymerase L or the Klenow fragment of E. coU should be noted that in the amplification of both V^- and 

DNA polymerase. The starting polynucleotide is amplified V L -coding DNA homologs the conserved 5' nucleotide 

by alternating between the production of an RNA polynucle- sequence of the second primer can be complementary to a 

otide and DNA polynucleotide. sequence exogenously added using terminal deoxynucleoti- 

Primers may also contain a template sequence or repli- is dyl transferase as described by Loh et al.. Science, 

cation initiation site for a RNA-directed RNA polymerase. 243:217-220 (1989). One or both of the first and second 

Typical RNA-directed RNA polymerase include the QB primers can contain a nucleotide sequence defining an 

replicase described by Lizardi et al.. Biotechnology, endonuciease recognition site. The site can be heterologous 

6:1197-1202 (1988) RNA-directed polymerases produce to the immunoglobulin gene being amplified and typically 

large numbers of RNA strands from a small number of 20 appears at or near the 5' end of the primer, 

template RNA strands that contain a template sequence or When present, the restriction site-defining portion is typi- 

repUcation initiation site. These polymerases typically give cally located in a 5'-terminal noo-priming portion of the 

aoneniimon-foldaniplifi^^ P™«. The restriction site defined by the first pnmer is 

been described by Kramer et aL,7. Mol BioL 89:719-736 typically chosen to be one recognized by a restriction 

(1974) 25 enzyme that does not recognize the restriction site defined 

The polynucleotide primers can be prepared using any by the second primer, the objective being to be able to 

suitable method, such as. for example, the phosphotriester or produce a DNA molecule having cohesive termini that are 

phosphodiestcr methods see Narang et aL. Meth. EnzymoL, non-complementary to each other and thus allow directional 

68 90 (1979); U.S. Pat. No. 4356,270; and Brown et al., insertion into a vector. 

Merh. EnzymoL 68:109, ( 1979). 30 ^ onc embodiment, the present invention utilizes a set of 

The choice of a primer's nucleotide sequence depends on polynucleotides that form primers having a priming region 

factors such as the distance on the nucleic acid from the located at the 3^terminus of the primer. The priming region 

region coding for the desired receptor, its hybridization site is typically the 3'-most (T-terminal) 15 to 30 nucleotide 

on the nucleic acid relative to any second primer to be used. bases. The J-terminal priming portion of each primer is 

the number of genes in the repertoire it is to hybridize to. and 35 capable of acting as a primer to catalyze nucleic acid 

me ^ synthesis, i.e., initiate a primer extension reaction off its 3' 

a. Primers for Prc^udiig Immunoglobulin Gene Repertoires terminus. One or both of the primers can additionally 

V w and gene repertoires can be separately prepared contain a 5 -terminal (5-most) non-priming portion, i.e.. a 

prior to their utilization in the present invention. Repertoire region that does not participate in hybridization to repertoire 

preparation is typically accomplished by primer extension, 40 template. 

preferably by primer extension in a polymerase chain reac- In PCR, each primer works in combination with a second 

tion (PCR) format primer to amplify a target nucleic acid sequence. The choice 

To produce a repertoire of V^oding DNA homologs by of PCR primer pairs for use in PCR is governed by consid- 

primer extension, the nucleotide sequence of a primer is erations as discussed herein for producing gene repertoires, 

selected to hybridize with a plurality of immunoglobulin 45 That is. the primers have a nucleotide sequence that is 

heavy chain genes at a site substantially adjacent to the complementary to a sequence conserved in the repertoire. 

V^-coding region so that a nucleotide sequence coding for Useful V„ and V t rjrirning sequences are shown in Tables 5 

a functional (capable of binding) polypeptide is obtained. To and 6, herein below. 

hybridize to a plurality of different V^-coding nucleic acid 4. Polymerase Chain Reaction to Produce Gene Repertoires 

strands, the primer must be a substantial complement of a 50 The strategy used for cloning the V w and genes 

nucleotide sequence conserved among the different strands. contained within a repertoire will depend, as is well known 

Such sites include nucleotide sequences in the constant in the art on the type, complexity, and purity of the nucleic 

region any of the variable region framework regions, pref- acids making up the repertoire. Other factors include 

erably the third framework region, leader region, promoter whether or not the genes are contained in one or a plurality 

region. J region and the like. 55 of repertoires and whether or not they are to be amplified 

If the repertoires of V^coding and V L -coding DNA and/or mutagenized. 
homologs are to be produced by (PCR) amplification, two The V„- and V^-coding gene repertoires are comprised of 
primers Le., a PCR primer pair, must be used for each polynucleotide coding strands, such as mRNA and/or the 
coding strand of nucleic acid to be amplified. The first sense strand of genomic DNA. If the repertoire is in the form 
primer becomes part of the nonsense (minus or 60 of double stranded genomic DNA. it is usually first 
complementary) strand and hybridizes to a nucleotide denatured, typically by melting, into single strands. A rep- 
sequence conserved among V H (plus or coding) strands ertoire is subjected to a PCR reaction by treating 
within the repertoire. To produce V„ coding DNA (contacting) the repertoire with a PCR primer pair, each 
homologs, first primers are therefore chosen to hybridize to member of the pair having a preselected nucleotide 
(i e. be complementary to) conserved regions within the J 65 sequence. The PCR primer pair is capable of initiating 
region. CHI region, hinge region. CH2 region, or CH3 primer extension reactions by hybridizing to nucleotide 
region of immunoglobulin genes and the like. To produce a sequences, preferably at least about 10 nucleotides in length 
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and more preferably at least about 20 nucleotides in length, 
conserved within the repertoire. The first primer of a PCR 
primer pair is sometimes referred to herein as the "sense 
primer** because it hybridizes to the coding or sense strand 
of a nucleic acid. In addition, the second primer of a PCR 
primer pair is sometimes referred to herein as the "anti- sense 
primer" because it hybridizes to a non-coding or anti- sense 
strand of a nucleic arid. i.e.. a strand complementary to a 
coding strand. 

The PCR reaction is performed by mixing the PCR primer 
pair, preferably a predetermined amount thereof, with the 
nucleic acids of the repertoire, preferably a predetermined 
amount thereof, in a PCR buffer to form a PCR reaction 
admixture. The admixture is maintained under polynucle- 
otide synthesizing conditions for a time period, which is 
typically predetermined, sufficient for the formation of a 
PCR reaction product thereby producing a plurality of 
different V^oding and/or V^-coding DNA homologs. 

A plurality of first primer and/or a plurality of second 
primers can be used in each amplification, e.g.. one species 
of first primer can be paired with a number of different 
second primers to form several different primer pairs. 
Alternatively, an individual pair of first and second primers 
can be used. In any case, the amplification products of 
amplifications using the same or different combinations of 
first and second primers can be combined to increase the 
diversity of the gene library. 

In another strategy, the object is to clone the V^- and/or 
V^-coding genes from a repertoire by providing a polynucle- 
otide complement of the repertoire, such as the anti-sense 
strand of genomic dsDNA or the polynucleotide produced 
by subjecting mRNA to a reverse transcriptase reaction. 
Methods for producing such complements are well known in 
the art. 

The PCR reaction is performed using any suitable 
method. Generally it occurs in a buffered aqueous solution, 
Le., a PCR buffer, preferably at a pH of 7-9. most preferably 
about 8. Preferably, a molar excess (for genomic nucleic 
acid, usually about 10 6 :1 primer:template) of the primer is 
admixed to the buffer containing the template strand. A large 
molar excess is preferred to improve the efficiency of the 
process. 

The PCR buffer also contains the deoxy ribo nu cleotide 
triphosphates dATP, dCTP, dGTP. and dTTP and a 
polymerase, typically thermostable, all in adequate amounts 
for primer extension (polynucleotide synthesis) reaction. 
The resulting solution (PCR admixture) is heated to about 
90° C -100° C. for about 1 to 10 minutes, preferably from 
1 to 4 minutes. After this heating period the solution is 
allowed to cool to 54° C. which is preferable for primer 
hybridization. The synthesis reaction may occur at from 
room temperature up to a temperature above which the 
polymerase (inducing agent) no longer functions efficiently. 
Thus, for example, if DNA polymerase is used as inducing 
agent, me temperature is generally no greater than about 40° 
C. An exemplary PCR buffer comprises the following: 50 
mM KG; 10 mMTris-HCl; pH 8.3; 1.5 mM Mgcl 2 ; 0.001% 
(wt/vol) gelatin, 200 uM dATP; 200 pM dTTP; 200 uM 
dCTP; 200 uM dGTP; and 2.5 units Thermus aquaticus 
DNA polymerase I (U.S. Pat. No. 4,889,818) per 100 
microliters of buffer. 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 
for this purpose include, for example, E. colt DNA poly- 
merase L Klenow fragment of £, coli DNA polymerase L T4 
DNA polymerase, other available DNA polymerases. 
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reverse transcriptase, and other enzymes, including heat- 
stable enzymes, which will facilitate combination of the 
nucleotides in the proper manner to form the primer exten- 
sion products which are complementary to each nucleic acid 

5 strand. Generally, the synthesis will be initiated at the 3' end 
of each primer and proceed in the 5' direction along the 
template strand, until synthesis terminates, producing mol- 
ecules of different lengths. There may be inducing agents, 
however, which initiate synthesis at the 5' end and proceed 

10 in the above direction, using the same process as described 
above. 

The inducing agent also may be a compound or system 
which will function to accomplish the synthesis of RNA 
primer extension products, including enzymes. In pref erred 

15 embodiments, the inducing agent may be a DNA-dependent 
RNA polymerase such as T7 RNA polymerase. T3 RNA 
polymerase or SP6 RNA polymerase. These polymerases 
produce a complementary RNA polynucleotide. The high 
turn over rate of the RNA polymerase amplifies the starting 

20 polynucleotide as has been described by Chamberlin et al„ 
The Enzymes, ed. P. Boyer, PP. 87-108, Academic Press. 
New York (1982). Another advantage of T7 RNA poly- 
merase is that mutations can be introduced into the poly- 
nucleotide synthesis by replacing a portion of cDNA with 

25 one or more mutagenic oligodeoxynucleotides 
(polynucleotides) and transcribing the partially-mismatched 
template directly as has been previously described by Joyce 
et aL, Nuc. Acid Res^ 17:711-722 (1989). Amplification 
systems based on transcription have been described by 

30 Gingeras et al., in PCR Protocols, A Guide to Methods and 
Applications, pp 245-252, Academic Press. Inc.. San Diego. 
Calif. (1990). 

If the inducing agent is a DNA-dependent RNA poly- 
merase and therefore incorporates ribonu cleo tide 

35 triphosphates, sufficient amounts of ATP. CTP. GTP and 
UTP are admixed to the primer extension reaction admixture 
and the resulting solution is treated as described above. 

The newly synthesized strand and its complementary 
nucleic acid strand form a double-stranded molecule which 

40 can be used in the succeeding steps of the process. 

The first and/or second PCR reactions discussed above 
can advantageously be used to incorporate into the receptor 
a preselected epitope useful in immunologically detecting 
and/or isolating a receptor. This is accomplished by utilizing 

45 a first and/or second polynucleotide synthesis primer or 
expression vector to incorporate a predetermined amino acid 
residue sequence into the amino acid residue sequence of the 
receptor. 

After producing V^- and V^-coding DNA homologs for a 

50 plurality of different V^- and V^-coding genes within the 
repertoires, the DNA molecules are typically further ampli- 
fied. While the DNA molecules can be amplified by classic 
techniques such as incorporation into an autonomously 
replicating vector, it is preferred to first amplify the mol- 

55 ecules by subjecting them to a polymerase chain reaction 
(PCR) prior to inserting them into a vector. PCR is typically 
carried out by thexmocyding i.e.. repeatedly increasing and 
decreasing the temperature of a PCR reaction admixture 
within a temperature range whose lower limit is about 10° C. 

60 to about 40° C. and whose upper limit is about 90° C. to 
about 100° C. The increasing and decreasing can be 
continuous, but is preferably phasic with time periods of 
relative temperature stability at each of temperatures favor- 
ing polynucleotide synthesis, denaturation and hybridiza- 

65 tion. 

PCR amplification methods are described in detail in U.S. 
Pat Nos. 4.683,192. 4.683.202. 4.800,159. and 4.965.188. 
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and at least in several texts including "PCR Technology: 
Principles and Applications for DNA Amplification". H. 
Erlich. ed.. Stockton Press, New York (1989); and "PCR 
Protocols: A Guide to Methods and Applications'*. Innis et 
al.. eds.. Academic Press, San Diego. Calif. (1990). 5 

In preferred embodiments only one pair of first and 
second primers is used per amplification reaction. The 
amplification reaction products obtained from a plurality of 
different amplifications, each using a plurality of different 
primer pain, are then combined 10 

However, the present invention also contemplates DNA 
homolog production via co-amplification (using two pairs of 
primers), and multiplex amplification (using up to about 8. 
9 or 10 primer pairs). 

In preferred embodiments, the PGR process is used not 
only to produce a library of DNA molecules, but also to 15 
induce mutations within the library or to create diversity 
from a single parental clone and thereby provide a library 
having a greater heterogeneity. First, it should be noted that 
the PCR process itself is inherently mutagenic due to a 
variety of factors well known in the art Second, in addition 20 
to the mutation inducing variations described in the above 
referenced U.S. Pat No. 4.683,195. other mutation inducing 
PCR variations can be employed. For example, the PCR 
reaction admixture, can be formed with different amounts of 
one or more of the nucleotides to be incorporated into the 25 
extension product. Under such conditions, the PCR reaction 
proceeds to produce nucleotide substitutions within the 
extension product as a result of the scarcity of a particular 
base. Similarly, approximately equal molar amounts of the 
nucleotides can be incorporated into the initial PCR reaction 30 
admixture in an amount to efficiently perform X number of 
cycles, and then cycling the admixture through a number of t 
cycles in excess of X, such as. for instance. 2X. 
Alternatively, mutations can be induced during the PCR 
reaction by incorporating into the reaction admixture nucle- 15 
otide derivatives such as inosine. not normally found in the 
nucleic acids of the repertoire being amplified. During 
subsequent in vivo amplification, the nucleotide derivative 
will be replaced with a substitute nucleotide thereby induc- 
ing a point mutation. 1 40 

5. Linear DNA Expression Vectors 

A DNA expression vector for use in a method of the 
invention for producing a library of DNA molecules is a 
linearized DNA molecule as described before having two 
(upstream and downstream) cohesive termini adapted for 45 
directional ligation to a polypeptide gene. 

A linear DNA expression vector is typically prepared by 
restriction endonuclease digestion of a circular DNA expres- 
sion vector of this invention to cut at two preselected 
restriction sites within the sequence of nucleotides, of the 50 
vector adapted for directional ligation to produce a linear 
DNA molecule having the required cohesive termini that are 
adapted for direction ligation. Directional ligation refers to 
the presence of two (a first and second) cohesive termini on 
a vector, or on the insert DNA molecule to be ligated into the 55 
vector selected, so that the termini on a single molecule are 
not complementary. A first terminus of the vector is comple- 
mentary to a first terminus of the insert and the second 
terminus of the vector is complementary to the second 
terminus of the insert 60 

6. Ligation Reactions to Produce Gene Libraries 

In preparing a library of DNA molecules of this invention, 
a ligation admixture is prepared as described above, and the 
admixture is subjected to ligation conditions for a time 
period sufficient for the admixed repertoire of polypeptide 65 
genes to ligate (become operatively linked) to the plurality 
of DNA expression vectors to form the library. 



Ligation conditions are conditions selected to favor a 
ligation reaction wherein a phosphodiester bond is formed 
between adjacent 3' hydroxyl and 5' phosphoryl termini of 
DNA. The ligation reaction is preferably catalyzed by the 
enzyme T4 DNA ligase. Ligation conditions can vary in 
time, temperature, concentration of buffers, quantities of 
DNA molecules to be ligated. and amounts of ligase, as is 
well known. Preferred ligation conditions involve maintain- 
ing the ligation admixture at 4 degrees Centigrade (4° C.) to 
12° C. for 1 to 24 hours in the presence of 1 to 10 units of 
T4 DNA ligase per milliliter (ml) and about 1 to 2 micro- 
grams (ug) of DNA. Ligation buffer in a ligation admixture 
typically contains 0.5M Tris-HCl (pH 7.4), 0.01M MgCl 2 . 
0.01M dithiothrietol. 1 mM spermidine. 1 mM ATP and 0.1 
rag/ml bovine scrum albumin (BSA). Other ligation buffers 
can also be used. 

Exemplary ligation reactions are described in Example 2. 
7. Preparation of Dicistronic Gene Libraries. 

In a particularly preferred embodiment the present inven- 
tion contemplates methods for the preparation of a library of 
dicistronic DNA molecules. A dicistronic DNA molecule is 
a single DNA molecule having the capacity to express two 
separate polypeptides from two separate cistrons. In pre- 
ferred embodiments, the two cistrons are operatively linked 
at relative locations on the DNA molecule such that both 
cistrons are under the transcriptional control of a single 
promoter. Each dicistronic molecule is capable of expressing 
first and second polypeptides from first and second cistrons. 
respectively, that can form, in a suitable host, a het- 
erodimeric receptor on the surface of a filamentous phage 
particle. 

The method for producing a library of dicistronic DNA 
molecules comprises the steps of: 

(a) Forming a first ligation admixture by combining in a 
ligation buffer. 

(i) a repertoire of first polypeptide genes in the form of 
dsDNA. each having cohesive termini adapted for 
directional ligation, and 

(ii) a plurality of DNA expression vectors in linear 
form* each having upstream and downstream first 
cohesive termini that are (a) adapted for directionaliy 
receiving the first polypeptide genes in a common 
reading frame, and (b) operatively linked to respec- 
tive upstream and downstream translatable DNA 
sequences. The upstream translatable DNA sequence 
encodes a pelB secretion si gnat the downstream 
translatable DNA sequence encodes a filamentous 
phage coat protein membrane anchor, and translat- 
able DNA sequences are operatively linked to 
respective upstream and downstream DNA expres- 
sion control sequences. 

(b) Subjecting the admixture to ligation conditions for a 
time period sufficient to operatively link the first 
polypeptide genes to the vectors and produce a plurality 
of circular DNA molecules each having a first cistron 
for expressing the first polypeptide. 

(c) Treating the plurality of circular DNA molecules under 
DNA cleavage conditions to produce a plurality of 
DNA expression vectors in linear form that each have 
upstream and downstream second cohesive termini that 
are (i) adapted for directionaliy receiving a repertoire of 
second polypeptide genes in a common reading frame, 
and (ii) operatively linked to respective upstream and 
downstream DNA sequences. The upstream DNA 
sequence is a translatable sequence encoding a secre- 
tion signaL the downstream DNA sequence has at least 
one stop codon in the reading frame, and the translat- 
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able DNA sequence is operatively linked to a DNA 
expression control sequence. 

(d) Forming a second ligation admixture by combining in 
a ligation buffer: 

(i) the plurality of DNA expression vectors formed in 
step (c). and 

(ii) the repertoire of second polypeptide genes in the 
form of dsDNA. each having cohesive termini 
adapted for directional ligation to the plurality of 
DNA expression vectors; and 

(e) Subjecting the second admixture to ligation conditions 
for a time period sufficient to operatively link the 
second polypeptide genes to said vectors and produce 
a plurality of circular DNA molecules each having the 
second cistron for expressing the second polypeptide, 
thereoy forming the library. In preferred embodiments 
a secretion signal is a pelB secretion signal, and the 
membrane anchor is derived from cpVTU as described 
herein. 

DNA expression vectors useful for practicing the above 
method are the dicistronic expression vectors described in 
greater detail before. 

In practicing the method of producing a library of dicis- 
tronic DNA molecules, it is preferred that the upstream and 
downstream first cohesive tennini do not have the same 
nucleotide sequences as the upstream and downstream sec- 
ond cohesive termini. In this einbodiment. the treating step 
(c) to linearize the circular DNA molecules typically 
involves the use of restriction endonucleases that are specific 
for producing said second termini, but do not cleave the 
circular DNA molecule at the sites that formed the first 
termini. Exemplary and preferred first and second termini 
are the termini defined by cleavage of pCBAK8 with Xho I 
and Spe I to form the upstream and downstream first termini 
and denned by cleavage of pCBAK8 with Sac I and Xba I 
to form the upstream and downstream second termini. In this 
embodiment, other pairs of cohesive termini can be utilized 
at the respective pairs of first and second termini, so long as 
the four termini are each distinct, non-complementary ter- 
mini. 

Methods of treating the plurality of circular DNA mol- 
ecules under DNA cleavage conditions to form linear DNA 
molecules are generally well known and depend on the 
nucleotide sequence to be cleaved and the mechanism for 
cleavage. Preferred treatments involve admixing the DNA 
molecules with a restriction endonudease specific for a 
endonuclease recognition site at the desired cleavage loca- 
tion in an amount sufficient for the restriction endonuclease 
to cleave the DNA molecule. Buffers, cleavage conditions, 
and substrate concentrations for restriction endonuclease 
cleavage are well known and depend on the particular 
enzyme utilized. Exemplary restriction enzyme cleavage 
conditions are described in Example 2. 

F. Phage Libraries 

The present invention contemplates a library of DNA 
molecules that each encode a fusion protein of this invention 
where the library is in the form of a population of different 
filamentous phage particles each containing one of the 
different rDNA molecules. By different ,DNA molecule is 
meant rDNA molecules differing in nucleotide base 
sequence encoding a polypeptide of this invention. 

Thus, a phage library is a population of filamentous 
phage, preferably fl, fd or M13 filamentous phage, each 
phage having packaged inside the particle a rDNA expres- 
sion vector of this invention. A preferred library is com- 
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prised of phage particles containing DNA molecules that 
encode at least 10 6 . preferably l(r and more preferably 
lO*^ 9 different fusion proteins of this invention. By different 
fusion proteins is meant fusion proteins differing in amino 

5 acid residue sequence. Where the packaged expression vec- 
tor encodes first and second polypeptides of an autogenously 
assembling receptor. e.g. V w and polypeptides that form 
a Fab. the library can also be characterized as containing or 
expressing a multiplicity of receptor specificities. Thus. 

10 preferred libraries express at least 10 3 . preferably at least 10 6 
and more preferably at least 10 7 different receptors, such as 
different antibodies. T cell receptors, in te grins and the like. 

As described herein, a particular advantage of a filamen- 
tous phage in the present invention is that the DNA molecule 

1 5 present in the phage particle and encoding one or both of the 
members of the heterodimeric receptor can be segregated 
from other DNA molecules present in the library on the basis 
of the presence of the particular expressed fusion protein the 
surface of the phage particle. 

20 Isolation (segregation) of a DNA molecule encoding one 
or both members of a heterodimeric receptor is conducted by 
segregation of the filamentous phage particle containing the 
gene or genes of interest away from the population of other 
phage particles coinprising the library. Segregation of phage 

25 particles involves the physical separation and propagation of 
individual phage particles away from other particles in the 
library. Methods for physical separation of filamentous 
phage particles to produce individual particles, and the 
propagation of the individual particles to form populations 

30 of progeny phage derived from the individual segregated 
particle are well known in the filamentous phage arts. 

A preferred separation method involves the identification 
of the expressed heterodimer on the surface of the phage 

35 particle by means of a ligand binding specificity between the 
phage particle and a preselected ligand. Exemplary and 
preferred is the use of "panning" methods whereby a sus- 
pension of phage particles is contacted with a solid phase 
ligand (antigen) and allowed to specifically bind (or immu- 

^ noreact where the heterodimer includes an immunoglobulin 
variable domain). After binding, non-bound particles are 
washed off the solid phase, and the bound phage particles are 
those that contain ligand-specific heterodimeric receptor 
(heterodimer) on their surface. The bound particles can then 

45 be recovered by elution of the bound particle from the solid 
phase, typically by the use of aqueous solvents having high 
ionic strength sufficient to disrupt the receptor-ligand bind- 
ing interaction. 
An alternate method for separating a phage particle based 

50 on the ligand specificity of the surf ace -expressed het- 
erodimer from a population of particles is to precipitate the 
phage particles from the solution phase by crosslinkage with 
the ligand. An exemplary and preferred crosslinking and 
precipitation method is described in detail in Example 4c. 

55 The use of the above particle segregation methods pro- 
vides a means for screening a population of filamentous 
phage particles present in a phage library of this invention. 
As applied to a phage library, screening can be utilized to 
enrich the library for one or more particles that express a 

60 heterodimer having a preselected ligand binding specificity. 
Where the library is designed to contain multiple species of 
heterodimers that all have some detectable measure of 
ligand binding activity, but differ in protein structure, 
antigenicity, ligand binding affinity or avidity, and the like, 

65 the screening methods can be utilized sequentially to first 
produce a library enriched for a preselected binding 
specificity, and then to produce a second library further 
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enriched by further screening comprising one or more 
isolated phage particles- Methods for measuring ligand 
binding activities, antigenicity and the like interactions 
between a ligand and a receptor are generally well known 
and are not discussed further as they are not essential 
features of the present invention. 

Thus, in one embodiment, a phage library is a population 
of particles enriched for a preselected ligand binding speci- 
ficity. 

In another emoodiment, a phage library comprises a 
population of particles wherein each particle contains at 
least one fusion protein of this invention on the surface of 
the phage particle. The actual amount of fusion protein 
present on the surface of a phage particle depends, in part, 
on the choice of coat protein membrane anchor present in the 
fusion protein. Where the anchor is derived from cpIEL there 
are typically about 1 to 4 fusion proteins per phage particle. 
Where the anchor is derived from the more preferred cpVTH. 
there is the potential for hundreds of fusion proteins on the 
particle surface depending on the growth conditions and 
other factors as discussed herein. Preferably, a phage particle 
in a library contains from about 10 to about 500 cpVHI- 
derived fusion proteins on the surface of each particle, and 
more preferably about 20 to 50 fusion proteins per particle. 
Exemplary amounts of surface fusion protein are shown by 
the electron micrographs described in Example 4a that 
describe particles having about 20 to 24 cpVUI-derived 
fusion proteins per particle. 

In another embodiment, the present invention contem- 
plates a population of phage particles that are the progeny of 
a single particle, and therefor all express the same het- 
erodimer on the particle surface. Such a population of phage 
are homogeneous and donally derived, and therefore pro- 
vide a source for expressing large quantities of a particular 
fusion protein. An exemplary clonalfy homogeneous phage 
population is described in Example 4. 

A filamentous phage particle in a library of this invention 
is produced by standard filamentous phage particle prepa- 
ration methods and depends on the presence in a DNA 
expression vector of this invention of a filamentous phage 
origin of replication as described herein to provide the 
signals necessary for (1) production of a single-stranded 
filamentous phage replicative form and (2) packaging of the 
replicative form into a filamentous phage particle. Such a 
DNA molecule can be packaged when present in a bacterial 
cell host upon introduction of genetic complementation to 
provide the filamentous phage proteins required for produc- 
tion of infectious phage particles. A typical and preferred 
method for genetic complementation is to infect a bacterial 
host cell containing a DNA expression vector of this inven- 
tion with a helper filamentous phage, thereby providing the 
genetic elements required for phage particle assembly. 
Exemplary helper rescue methods are described herein at 
Example 2, and described by Short et aL. Nuc. Acids Res., 
16:7583-7600 (1988). 

The level of heterodimeric receptor captured on the 
surface of a filamentous phage particle during the process of 
phage particle extrusion from the host cell can be controlled 
by a variety of means. In one embodiment, the levels of 
fusion proteins are controlled by the use of strong promoters 
in the first and second cistrons for expressing the 
polypeptides, such that transcription of the fusion protein 
cistrons occurs at a relative rate greater than the rate of 
transcription of the cpVm gene on the helper phage. In 
another embodiment, the helper phage can have an amber 
mutation in the gene for expressing cpVUI. such that less 
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wild-type cpVTH is transcribed in the host cell than fusion 
proteins, thereby leading to increased ratios of fusion protein 
compared to cpVTH during the extrusion process. 
In another embodiment, the amount of heterodimeric 

5 receptor on the phage particle surface can be controlled by 
controlling the timing between expression of fusion proteins 
and the superinfection by helper phage. After introduction of 
the expression vector, longer delay times before the addition 
of helper phage will allow for increased accumulation of the 

10 fusion proteins in the host celL 

EXAMPLES 

The following examples are intended to illustrate, but not 
15 limit, the scope of the invention. 

1. Construction of a Dicistronic Expression Vector 
for Producing a Heterodimeric Receptor on Phage 
Particles 

20 To obtain a vector system for generating a large number 
of Fab antibody fragments that can be screened directly, 
expression libraries in bacteriophage Lambda have previ- 
ously been constructed as described in Huse et al.. Science, 
246:1275-1281 (1989). These systems did not contain 

25 design features that provide for the expressed Fab to be 
targeted to the surface of a filamentous phage particle. 

The main criterion used in choosing a vector system was 
the necessity of generating the largest number of Fab frag- 

30 ments which could be screened directly. Bacteriophage 
Lambda was selected as the starting point to develop an 
expression vector for three reasons. First in vitro packaging 
of phage DNA was the most efficient method of reintroduc- 
ing DNA into host cells. Second, it was possible to detect 

35 protein expression at the level of single phage plaques. 
Finally, the screening of phage libraries typically involved 
less difficulty with nonspecific binding. The alternative, 
plasmid cloning vectors, are only advantageous in the analy- 
sis of clones after they have been identified. This advantage 

40 was not lost in the present system because of the use of a 
dicistronic expression vector such as pCombVTH. thereby 
permitting a plasmid containing the heavy chain, light chain, 
or Fab expressing inserts to be excised. 

45 a. Construction of Dicistronic Expression Vector 
pCOMB 

(i) Preparation of Lambda Zap™ II 

Lambda Zap™ II is a derivative of the original Lambda 
Zap (XTCC #40.298) that maintains all of the characteristics 

50 of the original Lambda Zap including 6 unique cloning sites, 
fusion protein expression, and the ability to rapidly excise 
the insert in the form of a phagemid (Bluescript SK-), but 
lacks the SAM 100 mutation, allowing growth on many 
Non-Sup F strains, including XLl-Blue. The Lambda Zap™ 

55 II was constructed as described in Short et aL. Nuc. Acids 
/?«., 16:7583-7600. 1988, by replacing the Lambda S gene 
contained in a 4254 base pair (bp) DNA fragment produced 
by digesting Lambda Zap with the restriction enzyme Nco L 
This 4254 bp DNA fragment was replaced with the 4254 bp 

60 DNA fragment containing the Lambda S gene isolated from 
Lambda gtlO (XTCC #40.179) after digesting the vector 
with the restriction enzyme Nco L The 4254 bp DNA 
fragment isolated from lambda gtlO was ligated into the 
original Lambda Zap vector using T4 DNA ligase and 

65 standard protocols such as those described in Current Pro- 
tocols in Molecular Biology. Ausubel et aL. eds.. John Wiley 
and Sons, NY. 1987, to form Lambda Zap™ IL 
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(ii) Preparation of Lambda Hc2 

To express a. plurality of V^coding DNA homologs in an 
£ colt host cell, a vector designated Lambda Hc2 was 
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solution was maintained at 37° C. for 30 minutes and then 
theT4 polynucleotide kinase was inactivated by maintaining 
the solution at 65° C. for 10 minutes. 



TABLE 3 



5' GGGCCCAAAITCTAITTCAAGGAGACAGTCAr 3* 

5' AATCAAA1ACCTAITGCCTACGGCAGCCOCTCGATT 3* 

5' GTIATTACTCGCnX3CX:CAACCAGCCAlpGCCC 3' 

5* CAGTTTCA(XTTXX}GCCATGGCTGGTTXXX5 3* 

5* C AGCGAGTIAAIAAC AATCCAGCGGCTGCCGTAGGC AATAG 3' 

5' GT^TTTCATTAIGACTGTCTCCITGAAAIAGAATTTCC 3' 

5* AGGTTG AAACTGCTCG AG ATTTCTAG ACTAGTTACCCGTAC 3" 

5' CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3* 

5* GACGTITCCGGACIACGCTrcTTAAlAGAATrCG 3* 

5' TCGACGAATTCTATTAAGAACCGTAGTC 3*^ 



SEQ. 




ID. NO. - 


(22) 


Nl) 


(23) 


N2) 


(24) 


N3) 


(25) 


N6) 


(26) 


N7) 


(27) 


N8) 


(28) 


N9-4) 


(29) 


N10-5) 


(30) 


Nil) 


(31) 


N12) 



constructed. The vector provided the following: the capacity 
to place the V^-coding DNA homologs in the proper reading 
frame; a ribosome binding site as described by Shine et al. 
Nature, 254:34. 1975; a leader sequence directing the 
expressed protein to the periplasmic space designated the 
pelB secretion signal; a polynucleotide sequence that coded 
for a known epitope (epitope tag); and also a polynucleotide 
that coded for a spacer protein between the V^-coding DNA 
homolog and the polynucleotide coding for the epitope tag. 
Lambda Hc2 has been previously described by Huse et al.. 
Science, 246:1275-1281 (1989). 

To prepare Lambda HcZ a synthetic DNA sequence 
containing all of the above features was constructed by 
designing single stranded polynucleotide segments of 20-40 
bases that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in FIG. 3. The 
individual single-stranded polynucleotide segments are 
shown in Table 3. 

Polynucleotides N2. N3, N9-4, NIL N10-5, N6, N7 and 
N8 (Table 3) were kinased by adding 1 ul of each poly- 
nucleotide 0.1 micrograms/micToliter (ug/pl) and 20 units of 
T A polynucleotide kinase to a solution containing 70 mM 
Tris-HCL pH 7.6. 10 mM MgQ 2 . 5 mM dithiothreitol 
(DTT), 10 mM beta-mercaptoethanoK 500 micrograms per 
millili ter (ug/ml) bo vine serum albumin (BSA).The solution 
was maintained at 37 degrees Centigrade (37° C.) for 30 
minutes and the reaction stopped by maintaining the solution 
at 65° C. for 10 minutes. The two end polynucleotides, 20 
ng of polynucleotides Nl and polynucleotides N12, were 
added to the above kinasing reaction solution together with 
V\o volume of a solution containing 20.0 mM Tris-HCL pH 
7.4. 2.0 mM Mgd 2 aud 50.0 mM Nad This solution was 
heated to 70° C. for 5 minutes and allowed to cool to room 
temperature, approximately 25° C- over 1.5 hours in a 500 
ml beaker of water. During this time period all 10 poly- 
nucleotides annealed to form the double stranded synthetic 
DNA insert shown in FIG. 3. The individual polynucleotides 
were covalently linked to each other to stabilize the syn- 
thetic DNA insert by adding 40 ul of the above reaction to 
a solution containing 50 mM Tris-HCL pH 7-5, 7 mM 
MgCl 2 .l mM DTT. 1 mM adenosine triphosphate (ATP) and 
10 units of T4 DNA ligasc. This solution was maintained at 
37° C for 30 minutes and then the T4 DNA ligase was 
inactivated by maintaining the solution at 65° C. for 10 
minutes. The end polynucleotides were kinased by mixing 
52 ul of the above reaction, 4 ul of a solution containing 10 
mM ATP and 5 units of T4 polynucleotide kinase. This 



20 The completed synthetic DNA insert was ligated directly 
into the Lambda Zap™ II vector described in Example la(i) 
that had been previously digested with the restriction 
enzymes. Not I and Xho L The ligation mixture was pack- 
aged according to the manufacture's instructions using 

25 Gigapack II Gold packing extract available from Stratagene. 
La Jolla. Calif. The packaged ligation mixture was plated on 
XLl-Blue cells (Stratagene); Individual lambda plaques 
were cored and the inserts excised according to the in vivo 
excision protocol for Lambda Zap™ II provided by the 

30 manufacturer (Stratagene). This in vivo excision protocol 
moved the cloned insert from the Lambda Hc2 vector into a 
phagemid vector to allow easy for manipulation and 
sequencing. The accuracy of the above cloning steps was 
confirmed by sequencing the insert using the Sanger dideoxy 

35 method described in by Sanger et al., Proc. Natl. Acad. Set. 
USA 74:5463-5467. (1977) and using the manufacture's 
instructions in the AMV Reverse Transcriptase 35 S-AXP 
sequencing kit (Stratagene). The sequence of the resulting 
double- stranded synthetic DNA insert in the \ H expression 

40 vector (Lambda Hc2) is shown in FIG. 3. The sequence of 
each strand (top and bottom) of Lambda Hc2 is listed in the 
sequence listing as SEQ. ID. NO. 1 and SEQ. ID. NO. 2, 
respectively. The resultant Lambda Hc2 expression vector is 
shown in FIG. 4. 

45 (iii) Preparation of Lambda Lc2 

To express a plurality of V^-coding DNA homologs in an 
E. cod host cell, a vector designated Lambda Lc2 was 
constructed having the capacity to place the V^-coding DNA 
homologs in the proper reading frame, provided a ribosome 

50 binding site as described by Shine et al.. Nature, 25434 
(1975), provided the pelB gene leader sequence secretion 
signal that has been previously used to successfully secrete 
Fab fragments in E. coli by Lei et al.. J. Bac^. 169:4379 
(1987) and Better et aL. Science, 240:1041 (1988). and also 

55 provided a polynucleotide containing a restriction endonu- 
clease site for cloning. Lambda Lc2 has been previously 
described by Huse et al.. Science, 246:1275-1281 (1989). 

A synthetic DNA sequence containing all of the above 
features was constructed by designing single stranded poly- 

60 nucleotide segments of 20-60 bases that would hybridize to 
each other and form the double stranded synthetic DNA 
sequence shown in FIG. 5. The sequence of each individual 
single-stranded polynucleotide segment (01-08) within the 
double stranded synthetic DNA sequence is shown in Table 

65 4. 

Polynucleotides 02. 03. 04. 05. 06 and 07 (Table 4) were 
kinased by adding 1 Ml (0.1 ug/ul) of each polynucleotide 
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and 20 units of T 4 polynucleotide kinase to a solution 
containing 70 mM Tris-HCl. pH 7.6. 10 raM MgCl. 5 raM 
DHTT. 10 mM bcta-mercaptoethanol. 500 mg/ml of BSA. The 
solution was maintained at 37° C. for 30 minutes and the 
reaction stopped by maintaining the solution at 65° C far 10 
minutes. The 20 ng each of the two end polynucleotides. 01 
and 08. were added to the above kinasing reaction solution 
together with Vio volume of a solution containing 20.0 mM 
Tris-HCl. pH 7.4. 2.0 mM MgCI and 15.0 mM sodium 
chloride (NaCl). This solution was heated to 70° C. for 5 
minutes and allowed to cool to room temperature, approxi- 
mately 25° C. over 1.5 hours in a 500 ml beaker of water. 
During this time period all 8 polynucleotides annealed to 
form the double stranded synthetic DNA insert shown in 
FIG. 5. The individual polynucleotides were covalently 
linked to each other to stabilize the synthetic DNA insert by 
adding 40 ul of the above reaction to a solution containing 
50 ml Tris-HCl. pH 7.5. 7 ml MgCl. 1 mm DTT. 1 mm ATP 
and 10 units of T4 DNAligase. This solution was maintained 
at 37° C. for 30 minutes and then the T4 DNA ligase was 
inactivated by maintaining the solution at 65° Q for 10 
minutes. The end polynucleotides were kinased by mixing 
52 ul of the above reaction, 4 ul of a solution containing 10 
mM ATP and 5 units of T4 polynucleotide kinase. This 
solution was maintained at 37° C. for 30 minutes and then 
theT4 polynucleotide kinase was inactivated by m ai n tain in g 
the solution at 65° C. for 10 minutes. 

TABLE 4, 
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Shine -Dal garno ribosome binding site as shown in the 
sequence in FIG. 5 and in SEQ. ID. NO. 3. A Spe I 
restriction site is also present in Lambda Hc2 as shown in 
FIGS. 3 and 4 and in SEQ. ID. NO. 1. A combinatorial 

5 vector, designated pComb. was constructed by combining 
portions of Lambda Hc2 and Lc2 together as described in 
Example la(iv) below. The resultant combinatorial pComb 
vector contained two Spe I restriction sites, one provided by 
Lambda Hc2 and one provided by Lambda Lc2, with an 

to EcoR I site in between. Despite the presence of two Spe I 
restriction sites. DNA homologs having Spe I and EcoR I 
cohesive termini were successfully directionally ligated into 
a pComb expression vector previously digested with Spe I 
and EcoR I as described in Example lb below. The prox- 

15 unity of the EcoR I restriction site to the 3' Spe I site, 
provided by the Lc2 vector, inhibited the complete digestion 
of the 3* Spe I site. Thus, digesting pComb with Spe I and 
EcoR I did not result in removal of the EcoR I site between 
the two Spe I sites. 

20 The presence of a second Spe I restriction site may be 
undesirable for ligations into a pComb vector digested only 
with Spe I as the region between the two sites would be 
elirninated. Therefore, a derivative of Lambda Lc2 lacking 
the second or 3* Spe I site, designated Lambda Lc3. is 

23 produced by first digesting Lambda Lc2 with Spe I to form 
a linearized vector. The ends are filled in to form blunt ends 
which are ligated together to result in Lambda Lc3 lacking 



SEQ. 
ID. NO. 



(32) 01) 5' tgaatknaaactagtcgccaaggagacagtcat 3' 

(33) 02) 5' aatq aaatacctattgc ctacggc agccgctgg att 3' 

(34) 03) 5' gttattactcgctgcccaaccag<x:atggcc 3* 

(35) 04) 5' gagctcgtcagttctagagttaagcggccg 3* 

(36) 05) 5' GTATITCATrXTCACTtJICTC 

TTCAAGCT 3* 

(37) 06) 5' CAGCGAGTAATAAC AATCC AGCGGCTGC CXTTAGGCAATAG 3' 

(38) 07) 5* TGACGAGCTCGGCCATCGCTC<mOGG 3* 

(39) 08) 5* TCG ACGGCC GCTTAACTCTAG AAC 3* 



The completed synthetic DNA insert was ligated directly 
into the Lambda Zap™ TJ vector described in Example 
l(aXi) that had been previously digested with the restriction 45 
enzymes Sac I and Xho L The ligation mixture was pack- 
aged according to the manufacture's instructions using 
Gigapack II Gold packing extract (Stratagene). The pack- 
aged ligation mixture was plated on XLl-Blue cells 
(Stratagene). Individual lambda plaques were cored and the 50 
inserts excised according to the in vivo excision protocol for 
Lambda Zap™ II provided by the manufacturer 
(Stratagene). This in vivo excision protocol moved the 
cloned insert from the Lambda Lc2 vector into a plasmid 
phagemid vector allow for easy manipulation and scquenc- 55 
ing. The accuracy of the above cloning steps was confirmed 
by sequencing the insert using the manufacture's instruc- 
tions in the AMV Reverse Transcriptase 35 S-dATP sequenc- 
ing kit (Stratagene). The sequence of the resulting Lc2 
expression vector (Lambda Lc2) is shown in FIG. 5. Each 60 
strand is separately listed in the Sequence Listing as SEQ. 
ID. NO. 3 and SEQ. ID. NO. 4. The resultant Lc2 vector is 
schematically diagrammed in FIG. 6. 

A preferred vector for use in this invention, designated 
Lambda Lc3. is a derivative of Lainbda Lc2 prepared above. 65 
Lambda Lc2 contains a Spe I restriction site (ACTAGT) 
located 3' to the EcoR I restriction site and 5* to the 



a Spe I site. Lambda Lc3 is a preferred vector for use in 
constructing a combinatorial vector as described below, 
(iv) Preparation of pComb 

Fhagemids were excised from the expression vectors 
Lambda Hc2 or Lambda Lc2 using an in vivo excision 
protocol described above. Double stranded DNA was pre- 
pared from the phagemid-containing cells according to the 
methods described by Holmes et at. Anal. Biochenu, 
114:193 (1981). The phagemids resulting from in vivo 
excision contained the same nucleotide sequences for anti- 
body fragment cloning and expression as did the parent 
vectors, and are designated phagemid Hc2 and Lc2, corre- 
sponding to Lambda Hc2 and Lc2. respectively. 

For the construction of combinatorial phagemid vector 
pComb, produced by combining portions of phagemid Hc2 
and phagemid Lc2. phagemid Hc2 was first digested with 
Sac I to remove the restriction site located 5* to the LacZ 
promoter. The linearized phagemid was then blunt ended 
with T4 polymerase and ligated to result in a Hc2 phagemid 
lacking a Sac I site. The modified Hc2 phagemid and the Lc2 
phagemid were then separately restriction digested with Sea 
I and EcoR I and the linearized phagemids were ligated 
together at their respective cohesive ends. The ligated 
phagemid vector was then inserted into an appropriate 
bacterial host and transformants were selected on the anti- 
biotic ampicillin. 
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Selected ampicillin resistant transformants were screened 
for the presence of two Not I sites. The resulting ampicillin 
resistant combinatorial phagemid vector was designated 
pComb. the schematic organization of which is shown in 
FIG. 7. The resultant combinatorial vector. pComb. con- 
sisted of a DNA molecule having two cassettes to express 
two fusion proteins and having nucleotide residue sequences 
for the following operatively linked dements listed in a 5' to 
3' direction: a first cassette consisting of an inducible LacZ 
promoter upstream from the LacZ gene; a Not I restriction 
site; a ribosome binding site; a pelB leader; a spacer; a 
cloning region bordered by a 5' Xho and 3' Spe I restriction 
site; a decapeptide tag followed by expression control stop 
sequences; an EcoR I restriction site located 5' to a second 
cassette consisting of an expression control ribosome bind- 
ing site; a pelB leader, a spacer region; a cloning region 
bordered by a 5' Sac I and a 3' Xba I restriction site followed 
by expression control stop sequences and a second Not I 
restriction site. 

A preferred combinatorial vector for use in this invention, 
designated pComb2, is constructed by combining portions 
of phagemid Hc2 and phagemid Lc3 as described above for 
preparing pComb. The resultant combinatorial vector, 
pCombZ consists of a DNA molecule having two cassettes 
identical to pComb to express two fusion proteins identically 
to pComb except that a second Spe I restriction site in the 
second cassette is eliminated. 

b. Construction of Vectors pCombVffl and 
pCombm for Expressing Fusion Proteins Having a 
Bacteriophage Coat Protein Membrane Anchor 

Because of the multiple endonuclease restriction cloning 
sites, the pComb phagemid expression vector prepared 
above is a useful cloning vehicle for modification for the 
preparation of an expression vector of this invention. To that 
end, pComb is digested with EcoR I and Spe I followed by 
phosphatase treatment to produce linearized pComb. 

(i) Preparation of pCombVIH 

A PCR product produced in Example 2g and having a 
nucleotide sequence that defines a filamentous bacterioph- 
age coat protein vm (cpVTtt) membrane anchor domain and 
cohesive Spe I and EcoR I termini was admixed with the 
linearized pComb to form a ligation admixture. The cpVUI- 
membrane anchor-encoding PCR fragment was direction- 
ally ligated into the pComb phagemid expression vector at 
corresponding, cohesive termini, that resulted in forming 
pCombVUI (also designated pComb8). pCombVm contains 
a cassette defined by the nucleotide sequence shown in SEQ. 
ID. NO. 116 from nucleotide base 1 to base 208. and 
contains a pelB secretion signal operatively linked to the 
cpVUI membrane anchor. 

A preferred phagemid expression vector for use in this 
invention, designated either pComb2-Vm or pComb2-8. is 
prepared as described above by directionally ligating the 
cpvm membrane anchor-encoding PCR fragment into a 
pComb2 phagemid expression vector via Spe I and EcoR I 
cohesive terminii. The pComb2-8 has only one Spe I restric- 
tion site. 

(ii) Preparation of pComblH 

A separate phagemid expression vector was constructed 
using sequences encoding bacteriophage cpHI membrane 
anchor domain. A PCR product defining the cpITJ membrane 
anchor and Spe I and EcoR I cohesive termini was prepared 
as described for cpVUX the details of which are described in 
Example 2g. The cpIII- derived PCR product was then 
ligated into linearized pComb vector to form the vector 
pCombm (also designated pComb3). 
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A preferred phagemid expression vector for use in this 
invention, designated either pCorab2-m or pComb2-3. is 
prepared as described above by directionally ligating the 
cpni membrane anchor-encoding PCR fragment into a 
5 pComb2 phagemid expression vector via Spe I and Spe I 
cohesive terminii. The pComb2-3 has only one Spe I restric- 
tion site. 

c. Construction of pCBAK Vectors Having a 
Chloramphenicol Resistance Marker 

In order to utilize a different selectable marker gene, such 
as chloramphenicol acetyl transferase (CAT), for the selec- 
tion of bacteria transformed with a vector of this invention, 
expression vectors based on pComb were developed having 
15 a gene encoding CAT and are designated pCBAK vectors. 
The pCBAK vectors are prepared by combining portions of 
pCB and pComb. 

(i) Preparation of pCB 

pBlueScript phagemid vectors. pBC SK(-) and pBS SK(- 

20 ). (Stratagene), were modified and combined to generate a 
third vector designated pCB as described below. 

pBC SK(-). which contains a chloramphenicol resistance 
selectable marker gene, was digested with Bst BI and blunt 
ended with T4 polymerase. A second digestion with Pvu I 

25 allowed for the removal of a 1 kilobase (kb) fragment 
leaving a 2.4 kb linearized vector which retained the CAT 
selectable resistance marker gene, an inducible LacZ pro- 
moter upstream from the LacZ gene and a ColEl origin 
region. The 2.4 kb fragment was recovered. The pBS SK(-) 

30 vector was digested with Aat II and blunt ended with T4 
polymerase. A second digestion with Pvu I allowed for the 
isolation of an 800 base pair (bp) fragment containing the f 1 
origin of replication. Ligation of the pBS derived 800 bp f 1 
fragment with the 2.4 kb pBC fragment created a pCB 

35 precursor vector containing a Sac I site, an fl origin of 
replication, a CAT selectable resistance marker gene. ColEl 
origin, a multiple cloning site (MCS) flanked by T 3 and T 7 
promoters, and an inducible LacZ promoter upstream from 
LacZ gene. 

40 The pCB precursor vector was then digested with Sac I 
and blunt-ended with T4 polymerase. The T4 polymerase - 
treated pCB vector was then religated to form pCB vector 
and is lacking a Sac I site. 

(ii) Preparation of pCBAKO 

45 The pCB vector containing the CAT selectable resistance 
marker gene was digested with Sac II and Apa I and treated 
with phosphatase to prevent religation and to form linearized 
pCB vector. The pComb vector prepared in Example l(aXiv) 
was restriction digested with Sac II and Apa I to release a 

50 fragment containing nucleotide residue sequences starting 5' 
to the LacZ promoter and extending past the 3* end of the 
second Not I site. The Sac II and Apa I pComb DNA 
fragment was then directionally ligated into the similarly 
digested pCB vector to form phagemid expression vector 

55 pCBAKO. Preferred pCBAK expression vectors are con- 
structed with pComb2. The resultant pCBAK expression 
vector contains only one Spe I restriction site. 

(iii) Preparation of pCBAK8 

To prepare a pCBAK-based phagemid expression vector 
60 which encodes a bacteriophage coat protein membrane 
anchor domain in the expressed fusion protein. pCB 
phagemid cloning vector prepared in Example lc(ii) was 
linearized by digestion with Sac II and Apa L The pCom- 
bVm phagemid expression vector, prepared in Example 
65 lb(i). was restriction digested with Sac II and Apa I to form 
a fragment containing a nucleotide residue sequence starting 
5' to the LacZ promoter and extending past the 3* end of the 
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second Not 1 site. The fragment was directionally ligated 
into the linearized pCB cloning vector to form phagemid 
expression vector pCBAK8. 
<iv) Preparation of pCBAK3 

The phagemid expression vector. pCBAK3. for the 
expression of fusion protein having cpm membrane anchor 
domains, was similarly constructed by directionally ligating 
the Sac II and Apa I restriction digested fragment from 
pCombm with Sac II and Apa I linearized pCB cloning 
vector. 10 

2. Construction of Didstronic Expression Vectors 
for Expressing Anti-NPN Heterodimer on Phage 
Surfaces 

In practicing this invention, the heavy (Fd consisting of 15 
V„ and C w l) and light (kappa) chains (V L . CJ of antibodies 
are first targeted to the periplasm of E. coli for the assembly 
of heterodimeric Fab molecules. In order to obtain expres- 
sion of antibody Fab libraries on a phage surface, the 
nucleotide residue sequences encoding either the Fd or light 
chains must be operatively linked to the nucleotide residue 
sequence encoding a filamentous bacteriophage coat protein 
membrane anchor. Two preferred coat proteins for use in this 
invention in providing a membrane anchor are VHI and IH 
(cpvm and cpm respectively). In the Examples described 25 
herein, methods for operatively linking a nucleotide residue 
sequence encoding a Fd chain to either cpvm or cpm 
membrane anchors in a fusion protein of this invention are 
described. ^ 

In a phagemid vector, a first and second cistron consisting 
of translatable DNA sequences are operatively linked to 
form a didstronic DNA molecule. Each dstron in the 
didstronic DNA molecule is linked to DNA expression 
control sequences for the coordinate expression of a fusion 3J 
protein. Fd-cpVUI or Fd-cpm. and a kappa ligjit chain. 

The first dstron encodes a periplasmic secretion signal 
(pelB leader) operatively linked to the fusion protein, either 
Fd-cpVUI or Fd-cpm The second cistron encodes a second 
pelB leader operatively linked to a kappa light chain. The ^ 
presence of the pelB leader facilitates the coordinated but 
separate secretion of both the fusion protein and light chain 
from the bacterial cytoplasm into the periplasmic space. 

The process described above is schematically dia- 
grammed in FIG, 8. Briefly, the phagemid expression vector 45 
carries a chloramphenicol acetyl transferase (CAT) select- 
able resistance marker gene in addition to the Fd-cpVTH 
fusion and the kappa chain. The f 1 phage origin of replica- 
tion facilitates the generation of single stranded phagemid. 
The isopropyl thiogalactopyranoside (IPTG) induced 50 
expression of a dicistroaic message encoding the Fd-cpvm 
fusion (V„. C W1 , cpVm) and the light chain (V L * CJ leads 
to the formation of heavy and light chains. Each chain is 
delivered to the periplasmic space by the pelB leader 
sequence, which is subsequently deaved. The heavy chain is 55 
anchored in the membrane by the cpvm membrane anchor 
domain while the light chain is secreted into the periplasm. 
The heavy chain in the presence of light chain assembles to 
form Fab molecules. This same result can be achieved if, in 
the alternative, the light chain is anchored in the membrane so 
via a light chain fusion protein having a membrane anchor 
and heavy chain is secreted via a pelB leader into the 
periplasm 

With subsequent infection of E. coli with a helper phage, 
as the assembly of the filamentous bacteriophage progresses. 65 
the coat protein VHI is incorporated along the entire length 
of the filamentous phage particles as shown in FIGS. 8 and 
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9. If cpm is used, the accumulation occurs od the tail of the 
bacteriophage. The advantage of the utilization of mem- 
brane anchors from cpVm over cpm is two fold. Firstly, a 
multiplicity of binding sites, consisting of approximately 
5 2700 cpvm monomers assembled in a tubular array, exist 
along the particle surface. Secondly, the construct does not 
interfere with phage infectivity. 
a. Polynucleotide Selection 

The nucleotide sequences encoding the immunoglobulin 
protein CDR*s arc highly variable. However, there are 
several regions of conserved sequences that flank the V 
region domains of either the light or heavy chain, for 
instance, and that contain substantially conserved nucleotide 
sequences, i.e.. sequences that will hybridize to the same 
primer sequence. Therefore, polynucleotide synthesis 
(amplification) primers that hybridize to the conserved 
sequences and incorporate restriction sites into the DNA 
homolog produced that are suitable for operatively link i n g 
the synthesized DNA fragments to a vector were con- 
structed. More specifically, the primers are designed so that 
the resulting DNA homologs produced can be inserted into 
an expression vector of this invention in reading frame with 
the upstream translatable DNA sequence at the region of the 
vector containing the directional ligation means, 
(i) V H Primers 

For amplification of the V w domains, primers are 
designed to introduce cohesive termini compatible with 
directional ligation into trie unique Xho I and Spe I sites of 
the phagemid Hc2 expression vector. For example, the 3' 
primer (primer 12A in Table 5). was designed to be comple- 
mentary to the mRNA in the J w region. In all cases, the 5' 
primers (primers 1-10. Table 5) were chosen to be comple- 
mentary to the first strand cDNA in the conserved 
N-terminus region (antisense strand). Initially amplification 
was performed with a mixture of 32 primers (primer 1. Table 
5) that were degenerate at five positions. Hybridoma mRNA 
could be amplified with mixed primers, but initial attempts 
to amplify mRNA from spleen yielded variable results. 
Therefore, several alternatives to amplification using the 
mixed 5' primers were compared. 

The first alternative was to construct multiple unique 
primers, eight of which are shown in Table 5, corresponding 
to individual members of the mixed primer pool. The 
individual primers 2-9 of Table 5 were constructed by 
incorporating either of the two possible nucleotides at three 
of the five degenerate positions. 

The second alternative was to construct a primer contain- 
ing inosine (primer 10. Table 5) at four of the variable 
positions based on the published work of TakahashL et al.. 
Pnc. Natl Acad. ScL (USA.). 82:1931-1935, (1985) and 
Ohtsuka et al.,7. Biol Chem., 260:2605-2608, (1985). This 
primer has the advantage that it is not degenerate and, at the 
same time minimizes the negative effects of mismatches at 
the unconserved positions as discussed by Martin et aL. Nwc. 
Acids /?«., 13:8927 (1985). However, it was not known if 
the presence of inosine nucleotides would result in incor- 
poration of unwanted sequences in the cloned \ H regions. 
Therefore, inosine was not induded at the one position that 
remains in the amplified fragments after the cleavage of the 
restriction sites. As a result, inosine was not in the cloned 
insert. 

Additional V w amplification primers including the unique 
3* primer were designed to be complementary to a portion of 
the first constant region domain of the gamma 1 heavy chain 
mRNA (primers 16 and 17, Table 5). These primers will 
produce DNA homologs containing polynucleotides coding 
for amino acids from the V H and the first constant region 
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domains of the heavy chain. These DNA homologs can 
therefore be used to produce Fab fragments rather than an 

Additional unique 3* primers designed to hybridize to 
similar regions of another class of immunoglobulin heavy 
chain such as IgM. IgE and IgA are contemplated. Other 3* 
primers that hybridize to a specific region of a specific class 
of CH l constant region and arc adapted for transferring the 
V H domains amplified using this primer to an expression 
vector capable of expressing those V H domains with a 
different class of heavy or light chain constant region are 
also contemplated. 

As a control for amplification from spleen or hybridoma 
mRNA. a set of primers hybridizing to a highly conserved 
region within the constant region IgG, heavy chain gene 
were constructed. The 5' primer (primer 11. Table 5) is 
complementary to the cDNA in the C H 2 region whereas the 
J primer (primer 13. Table 5) is complementary to the 
mRNA in the C^3 region. It is believed that no mismatches 
were present between these primers and their templates. 

The primers used for amplification of heavy chain Fd 
fragments for construction of Fabs are shown at least in 
Table 5. Amplification was performed in eight separate 
reactions, each containing one of the 5' primers (primers 
2-9) and one of the 3' primers (primer 16). The remaining 
5* primers that have been used for amplification in a single 
reaction are either a degenerate primer (primer 1) or a primer 
that incorporates inosine at four degenerate positions 
(primer 10, Table 5. and primers 17 and 18, Table 6). The 
remaining 3* primer (primer 14. Table 6) has been used to 
construct F v fragments. Many of the 5' primers incorporate 
a Xho I site, and the 3* primers incorporate a Spe I restriction 
site for insertion of the V w DNA homolog into the phagemid 
Hc2 expression vector (FIG. 4). 

V H amplification primers designed to amplify human 
heavy chain variable regions are shown in Table 6. One of 
the 5' heavy chain primer contains inosine residues at 
degenerate nucleotide positions allowing a single primer to 
hybridize to a large number of variable region sequences. 
Primers designed to hybridize to the constant region 
sequences of various IgG mKNAs are also shown in Table 
6. 
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(ii) Primers 

The nucleotide sequences encoding the V L CDRs are 
highly variable. However, there are several regions of con- 
served sequences that flank the CDR domains including 
the J £ , framework regions and V L leader/promotor. 
Therefore, amplification primers were constructed that 
hybridized to the conserved sequences and incorporate 
restriction sites that allow cloning the amplified fragments 
into the phagemid Lc2 vector cut with Sac I and Xba L 

For amplification of the V L CDR domains, the 5* primers 
(primers 1-8 in Table 6) were designed to be complementary 
to the first strand cDNA in the conserved N-terminus region. 
These primers also introduced a Sac I restriction endonu- 
clease site to allow the DNA homolog to be cloned into 
the phagemid Lc2 expression vector. The 3' V L amplification 
primer (primer 9 in Table 6) was designed to be comple- 
mentary to the mRNA in the J L regions and to introduce the 
Xba 1 restriction endonuclease site required to insert the V L 
DNA homolog into the phagemid Lc2 expression vector 
(FIG. 6). 

Additional 3* amplification primers were designed to 
hybridize to the constant region of cither kappa or lambda 
mRNA (primers 10 and 11 in Table 6). These primers allow 
a DNA homolog to be produced containing polynucleotide 
sequences coding for constant region amino acids of either 
25 kappa or lambda chain. These primers make it possible to 
produce an Fab fragment rather than an F^ 

The primers used for amplification of kappa light chain 
sequences for construction of Fabs are shown at least in 
Table 6. Amplification with these primers was performed in 
30 5 separate reactions, each containing one of the 5' primers 
(primers 3-6. and 12) and one of the J primers (primer 13). 
The remaining 3' primer (primer 9) has been used to 
construct F v fragments. The 5' primers contain a Sac I 
restriction site and the 3' primers contain a Xba I restriction 
35 site. 

V L amplification primers designed to amplify human light 
chain variable regions of both the lambda and kappa iso- 
types are also shown in Table 6. 

All primers and synthetic polynucleotides described 
herein, including those shown in Tables 3-7 were either 
purchased from Research Genetics in Huntsville. Ala. or 
synthesized on an Applied Biosystems DNA synthesizer, 
model 381 A, using the manufacturer's instruction. 
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TABLE 5 



(1) 5-AOOT(CA3XC/A)A<G/AXrr(GaiCTCGAGrC(T/A)GG 3' 



(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
(°) 



5 'AGGTC CAGCTGCTC G AGTCTGG ,3* 
5 1 AGGTC CAGCTGCTC G AGTC AGG 3' 
5 AGGTC CAGC 1 1 L. 1 CG AGTCTGG 3* 
yAGGTCCAGCTTCTCGAGTCAGG 3" 
yAGGTCCAACTGCTCG AGTCTGG 3* 
yAGGTCCAACTGCTCGAGTCAGG T 
5AGGTCCAACTTCTCXjAGTCTGG 1 
yAGGTCCAACTTCTCGAGTCAGG 3' 



(10) 5*AGGTNNANCINCTCXjAGTCfiyA)GG 3' 



(11) 5'GCCCAAGGATrJTGCTCACC 3* 



degenerate 5' primer for the amplification 
of mouse and human heavy chain variable 
regions (Vh) 

Unique 5 primer for the impUficabon 

of mouse and human V H 

Unique 5* primer for the implication 

of mouse and human V K 

Unique 5 primer for the amplification 

of mouse and human V H 

Unique 5' primer for the amplification 

of mouse and human V H 

Unique 5* primer for the amplification 

of mouse and human V H 

Unique 5" primer for the amplification 

of mouse and *" iwg| V H 

Unique 5 1 primer for the implification 

of mouse and h nrrnqr> V H 

Unique 5* primer for the amplification 

of mouse and human V H 

S degenerate primer containing inosine at 

4 degenerate positions for amplification of 



5* primer for amplification in the C H 2 
region of mouse IgGl 
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TABLE 5-continued 



(1 2) yCTATTAGAATTCAACGGTAAC ACjKKJTGCCTTGGCCCC A 3* 
(12A) y CTATTAACTAGTAACtXTTAAC AGTGGTGCCTTG CCCCA 3' 

(13) 5'CTCAGTATGGTGGTTGTGC 3* 

(14) S'GCTACTAGTTTTGATTTCCACCTTGG 3* 

(15) 5'C AGCC ATGGCCG AC ATCC AGATG 3' 

(16) S'AATTTXACTAGTCACCTItXnTXrrGCrGGC 3* 

(17) SX^GCAACTAGTACAACCACAATCCCTGGGCACAATTTT 3* 

(18) 5'AGGCTTACTAGTACAATCCCTGGGCACAAT 3* 



3 primer for amplification of V H and 
introducing a 3' Eco Rl site 
3* primer for amplification of V H using 3' 
Spe I site 

3' primer for amplification in the C H 3 

region of mouse IgOl 

3* primer for amplification of mouse kappa 

light chain variable regions (V L ) 

5 primer for amplification of mouse kappa 

light chain variable regions 

Unique 3' primer for amplification of V H 

including part of the mouse gamma 1 first 

constant region 

Unique 3' primer for amplification of Fd 
including part of mouse IgGl first constant 
region and hinge region 
3' primer for amplifying mouse Fd including 
part of the mouse IgG first constant region 
and part of the hinge region 



TABLE 6 



(1 ) S CCACnrCCGAGCltXil 1 UTG ACTC AGG AATCT 3' 

(2) 5' CCAGTrCCGAGCTCGTGTTGACGCAGCCGCCC 3* 

(3) 5' ccagttccgagctogtgctcacccau i c rcc A 3* 

(4) y ccagttccgagctccagatgacccagtctcca 3' 
(5> y ccagatgtgagctcgtgatgacccagactoca y 

(6) 5'CCAGAlGr^GAGCTCG^CAIGACrCAC^C^CCA3 , 

(7) S CCAGATGTGAGCTCTTGATGACCCAAACTCAA 3" 

(8) 5* CCAGATGTGAGCTCGTGATAACCCAGGATGAA 3' 

(9) 5* GCAGCATTCTAGAGTTTCAGCTCCAGCTTGCC 3* 

(10) y CCGCCGTCTAGAACACTCATTCCTCTTGAAGCT 3* 

(11) 5 CCGCCGTCTAGAACATTCTGC AGGAGACAGACT J 

(12) S CCAGTICCGAGCTCGTOATGACACAGlCrCCA 3' 

(13) 5" GCGCCGTCTAG AATIAAC ACTC ATKX7I t» iTO AA 3* 

(14) 5" CTATXAAC^GXAACGGTAACAGTGGTGCCrTGCCCCA 3' 

(15) y AGGCT1ACXAGTACAATCCCTGGGCACAAT 3" 

(1 6) y GCCGCTCTAGAACACTCATTCCTGTTGAA 3* 

(17) 5' AGGTIIIAICnCTCGAGTtrrOC 3' 

(18) 5* AGGTOAlCTtCTCGAGTCAGC 3' 

'(19) y GTGC C AG AIGTG AGCTCGTG ATG AC CC AG* ICTCCA 3* 

(20) y TCCITCTAGATTACTAACACTCrrcCCCrGT^ 3* 

(21) y GC AT1U1AG ACTATTATG AAC ATTCTGTAGGGGC 3' 

(22) 5 CTGC ACAGGGTC CTGGGCCGAGCTCGTGGTG ACTCAG 3* 

(23) y AGTTCCAITIXXTCGAGTCTGG 3' 

(24) y GTGGGCATGrTGTGACriTGTCTC^ 3' 

(25) 5' ACKATCACTAGTACAAGATTroGGCTC 3* 



Unique 5' primer for the amplication 
of kappa light chain variable regions 
Unique S primer for the amplication 
of kappa light chain variable regions 
Unique T primer for the amplication 
of kappa light chain variable regions . 
Unique y primer for the amplication 
of kappa light chain variable regions 
Unique S primer for the amplication 
of kappa light chain variable regions 
Unique y primer for the amplication 
of kappa light chain variable regions 
Unique 5" primer for the amplication 
of kappa tight chain variable regions 
Unique 5* primer for the amplication 
of kappa tight chain variable regions 
Unique 3' primer for amplification of 
kappa tight chain variable regions 
Unique 3' primer for mouse kappa tight 
chain amplification including the 
constant region 

Unique 3' primer for mouse lambda light 
chain amptification including the 
constant region 
Unique y primer for V L 
amplification 

Unique 3' primer for amplification of 
kappa light chain 

Unique 3* primer for amplification of 
mouse F v 

Unique 3* primer for amplification 

of mouse IgG Fd 

Unique 3* primer for amplification 

of mouse kappa light chain 

Degenerate 5* primer conraming 

inosine at 4 degenerate positiona for 

amplifying mouse V H 

Degenerate 5' primer containing 

inosine at 4 degenerate positions for 

amplifying mouse V H 

Unique y primer for human and mouse 

kappa V L amplification 

Unique 3* primer for kappa 

V L amplification 

Unique 3* primer for human, mouse and 
rabbit lambda V L amplification 
Unique y primer for human lambda 
V L amplification 

5' degenerate primer for human V K 
amplification containing ^nr^rw 
at 3 degenerate positions 
Unique 3' primer for human V H 
amplification 

Unique 3* primer for human IgGl Fd 
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TABLE 6-continued 



(26) 5* AGCA3CACTAGTACAAGATTTGGGCTC 3* 

(27) S XCGTXjCACCTCCTCGAGTCGGG 3* 

(28) 5* AGGTCCAACTGCTCGAGTCTGG 3' 

(29) 5" AGGTGCAACTGCTCGAGTCGGO J 

(30) S TCCTTCTAG ATTACTAAC AC TCTCCC CTGTTG AA 3* 

(31) 5* CTGCACAGGGTCCTGGGCCGAGCTCCn'GGTGACTCAG 3' 

(32) S GCAITCTAGACTATTAAC^ITCTCnAGGGGC 3' 

(33) J' ACCCAAGGACACCCTCA1G 3" 

(34) 5" CTCAGTATGGTGGTTGTGC 3" 

(35) 5' GTCTCACTACnCTCCACCAACXXX:CCAlCCK3TC 3' 

(36) 5- AlAlACTAGTGAGACAGTOACCAGGGTTCCTrGGCCCXA 3" 

(37) 5" ACXnCTAGATTCCACCTIGGTCCC 3* 

(38) 5 1 GCAIACTAGTCTATTAAC^TTCIXnAGGGGC 3' 

(39) 5" CCGGAATTCTIATCATTIACCCGGAGA 3* 

(40) 5 TCTGCACTAGTrcGAATGGGCACATGCAG 3* 



amplification 

Unique 3' primer for amplification 
of human variable regions (V H ) 
Unique 3* primer for amplification 
of human variable regions (V H ) 
Unique 3' primer for amplification 
of human variable regions (V H ) 
Unique 3' primer for amplification 
of human variable regions (V H ) 
3' primer in human kappa light chain 
constant region 

5' primer for amplification of human 
lambda light chain variable regions 
3' primer in human lambda light chain 
constant region 

Control primer hybridizing to the 

human CH 2 region 

Control primer hybridizing to the 

human CH 3 region 

5' primer for amplifying human IgG 

heavy chain first constant region 

3' primer for amplifying human heavy 

chain variable regions 

3' primer for amplifying human kapp 

chain variable regions 

5* primer for amplifying human kappa 

light chain constant region 

3' primer located in the CH3 region of 

human IgGl to amplify the entire heavy 

chain 

3' primer for amplifying the Fd region 
of mouse IgM 



The 19 primers listed in Table 5 have been listed in the 
Sequence Listing and have been assigned the following 
SEQ. ID. NO.: 

(I) =SEQ. ID. NO. 40 
(2>=SEQ. ID. NO. 41 
(3>=SEQ. ID. NO. 42 
(4>=SEQ. ID. NO. 43 
(5>=SEQ. ID. NO. 44 
(6>=SEQ. ID. NO. 45 
C7)=SEQ. ID. NO. 46 
(8)=SEQ. ID. NO. 47 
(9>SEQ. ID. NO. 48 
(10)=SEQ. ID. NO. 49 

(II) =SEQ. ID. NO. 50 

(12) =SEQ. ID. NO. 51 
(12A)=SEQ. ID. NO. 52 

(13) =SEQ. ID. NO. 53 

(14) =SEQ. ID. NO. 54 

(15) =SEQ. ID. NO. 55 
(16>SEQ. ID. NO. 56 

(17) =SEQ. ID. NO. 57 

(18) =SEQ. ID. NO. 58 

The 40 primers listed as "{\y through "(40)" in Table 6 
have also been individually and sequentially listed in the 
Sequence Listing beginning with SEQ. ID. NO. 59 through' 
SEQ. ID. NO. 98. respectively. 

b. Preparation of a Repertoire of Genes Encoding Immuno- 
globulin Variable Domain 

Nitrophenylphosphonamidate (NPN) was selected as the 
ligand for receptor binding in preparing a heterodimeric 
receptor according to the methods of the invention. 

Keyhole limpet hemocyanin (KLH) was conjugated to 
NPN to form a NPN- KLH conjugate used for immunizing a 
mouse to produce an anti-NPN immune response and 
thereby provide a source of ligand specific heterodimeric 
receptor genes. 
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The NPN-KLH conjugate was prepared by admixing 250 
ul of a solution containing 2.5 mg of NPN in diraethylfor- 
mamide with 750 ul of a solution containing 2 mg of KLH 
in 0.01 Molar (M) sodium phosphate buffer (pH 7.2). The 
two solutions were admixed by slow addition of the NPN 
solution to the KLH solution while the KLH solution was 
being agitated by a rotating stirring bar. Thereafter the 
admixture was maintained at 4° C. for 1 hour with the same 
agitation to allow conjugation to proceed. The conjugated 
40 NPN-KLH was isolated from the nonconjugated NPN and 
KLH by gel filtration through Sephadcx G-25. The isolated 
NPN-KLH conjugate was injected into mice as described 
below. 

The NPN-KLH conjugate was prepared for injection into 

45 mice by adding 100 ug of the conjugate to 250 ul of 
phosphate buffered saline (PBS). An equal volume of com- 
plete Freund's adjuvant was added and emulsified the entire 
solution for 5 minutes. A 129 G, x + mouse was injected with 
300 ul of the emulsion. Injections were given subcutane- 

50 ously at several sites using a 21 gauge needle. A second 
immunization with NPN-KLH was given two weeks later. 
This injection was prepared as follows: 50 micrograms (ug) 
of NPN-KLH were diluted in 250 ul of PBS and an equal 
volume of alum was admixed to the NPN-KLH solution. 

55 The mouse was injected intraperitoneally with 500 ul of the 
solution using a 23 gauge needle. One month later the mice 
were given a final injection of 50 ug of the NPN-KLH 
conjugate diluted to 200 ul in PBS. This injection was given 
intravenously in the lateral tail vein using a 30 gauge needle. 

60 Five days after this final injection the mice were sacrificed 
and total cellular RNA was isolated from their spleens. 

Total cellular RNA was prepared from the spleen of a 
single mouse immunized with KLH-NPN as described 
above using the RNA preparation methods described by 

65 Chomczynski et aL. Anal Biochem^ 162:156-159 (1987) 
and using the RNA isolation kit (Stratagene) according to 
the manufacturer's instructions. Briefly, immediately after 
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removing the spleeo from the immunized mouse, the tissue polynucleotides, i.e.. greater than about 10 4 different 

was homogenized in 10 ml of a denaturing solution con- V w -coding genes, and contains a similar number of 

taining 4.0M guanine isothiocyanate. 0.25M sodium citrate V L -coding genes. Thus, the mRNA population represents a 

at pH 7.0. and 0.1M beta-mere apt oethanol using a glass repertoire of variable region-coding genes, 

homogenizer. One ml of sodium acetate at a concentration of 5 c . Preparation of DNA Homologs 

2M at pH 4.0 was admixed with the homogenized spleen. preparation for PCR amplification. mRNA prepared 

One ml of phenol that had been previously saturated with abovc is uscd as a template for cDNA synthesis by a primer 

H 2 0 was also admixed to the denaturing solution containing extension reaction. In a typical 50 \ii transcription reaction, 

the homogenized spleen. Two ml of a chloroform:isoamyl ug ^ lccn mRNA in water is first hybridized 

alcohol (24:1 v/v) mixture was added to this homogenate 1Q (anncalcd j witn 500 ng (50.0 pmol) of the 3' V„ primer 

The homogenate was mixed vigorously for ten seconds and (prtmer 12A Table 5). at 65° C. for five minutes, 

maintained on ice for 15 minutes. The homogenate was then Subsequcntl v the mixture is adjusted to 1.5 mM dXTP. 

transferred to a thick-walled 50 ml polypropylene cenm- dCT p M d(Jr ^ and dTTP, 40 mM Tris-HCl. pH 8.0. 8 mM 

fuged rube (Fisher Scientific ^ompany. Pittsburg Pa ) The ^ NaQ aQd 2 ^ s^Lc. Molo^y- 

SSS^ 15 MS hernia virus Reverse ^^ff^). 

l^^SXSS^ centrifugembe and mixed with 26 units, is added and the solution * mamtaoned for 1 hour 

an equal volume of isopropyl alcohol. This solution was at 37C. . 

maintained at -20° C. for at least one hour to precipitate the PCR amplification is performed in a 100 ul reaction 

RNA The solution containing the precipitated RNA was containing the products of the reverse transcription reaction 

centrifuged at 10.000xg for twenty minutes at 4° C. The 20 (approximately 5 ug of the cDNA/RNA hybrid). 300 ng of 

pelleted total cellular RNA was collected and dissolved in 3 3' V H primer (primer 12A of Table 5). 300 ng each of the 5 

ml of the denaturing solution described above. Three ml of V H primers (primers 2-10 of Table 5) 200 mM of a mixture 

isopropyl alcohol was added to the re-suspended total eel- of dNTP's, 50 mM KC1. 10 mM Tris-HCl pH 8.3. 15 mM 

lular RNA and vigorously mixed. This solution was main- MgCl 2 . 0.1% gelatin and 2 units of Thermus aquatic us (Taq) 

tained at -20° C for at least 1 hour to precipitate the RNA. 25 DNA polymerase. The reaction mixture is overlaid with 

The solution containing the precipitated RNA was centri- mineral oil and subjected to 40 cycles of amplification. Each 

fuged at 10 OOOxg for ten minutes at 4° C The pelleted RNA ampUfication cycle includes denaturation at 92° C. for 1 

was washed once with a solution containing 75% ethanoL rninute. annealing at 52° C. for 2 minutes and polynucleotide 

The pelleted RNA was dried under vacuum for 15 minutes synthesis by primer extension (elongation) at 72° C. for 1.5 

and then re-suspended in dimethyl pyrocarbonate (DEPG) 30 minutes. The amplified V^coding DNA homolog contain- 

treated (DEPC-H 2 0) H a O.. ing samples are then extracted twice with phenol/ 

Messenger RNA (mRNA) enriched for sequences con- chloroform, once with chloroform, ethanol precipitated and 

finin g long poly A tracts was prepared from the total are stored at -70° C. in 10 mM Tris-HCl, pH 7.5. and 1 mM 

cellular RNA using methods described in MoUcular Clon- EDTA. 

ing: A Laboratory ManimLU^^tX^^.CoUS^^ 35 Using unique 5' primers (2-9. Table 5), efficient 

Harbor. N.Y.. (1982). Briefly, one half of the total RNA V^-coding DNA homolog synthesis and amplification from 

isolated from a smgle immunized mouse spleen prepared as the spleen mRNA is achieved as shown by agarose gel 

described above was re-suspended in one ml of DEPC-H 2 0 electrophoresis, The amplified cDNA OV coding DNA 

and maintained at 65° C. for five minutes. One ml of 2xhigh homolog) was seen as a major band of the expected size (360 

salt loading buffer consisting of 100 mM Tris-HCl (Tris 40 bp). The amount the amplified V^oding polynucleotide 

[hydroxymethyl] amino methane hydrochloride), 1M fragment in each reaction is similar, indicating that all of 

sodium chloride (NaCl). 2.0 mM msomum emylene oiajTiine these primers were about equally efficient in initiating 

tetra-acetic acid (EDTA) at pH 7.5, and 0.2% sodium amplification. The yield and quality of the amplication with 

dodecyl sulfate (SDS) was added to the re- suspended RNA these primers is reproducible. 

and the mixture allowed to cool to room temperature. The 45 The primer containing inosine also synthesizes amplified 

mixture was then applied to an oligo-dT (Collaborative V„-coding DNA homologs from spleen mRNA 

Research Type 2 or Type 3) column that was previously reproducibly. leading to the production of the expected sized 

prepared by washing the oligo-dT with a solution containing fragment, of an intensity similar to that of the other ampli- 

0.1M sodium hydroxide and 5 mM EDTA and then equili- fied cDNAs. The presence of inosine also permits efficient 

orating the column with DEPC-H 2 0. The eluate was col- 50 DNA homolog synthesis and amplification, clearly indicat- 

lected in a sterile polypropylene tube and reapplied to the ing that such primers are useful in generating a plurality of 

same column after heating the eluate for 5 minutes at 65° C V„-coding DNA homologs. Amplification products 

The oligo dT column was then washed with 2 ml of high salt obtained from the constant region primers (primers 11 and 

loading buffer consisting of 50 mM Iris-HQ, pH 7.5, 500 13, Table 5) are more intense indicating that amplification 

mM sodium chloride. 1 mM EDTA at pH 7.5 and 0. 1% SDS. 55 was more efficient, possibly because of a higher degree of 

The oligo dT column was then washed with 2 ml of homology between the template and primers. Following the 

lxmedium salt buffer consisting of 50 mM Iris-HCL pH 7.5. above procedures, a V^-coding gene library is constructed 
100 mM. 1 mM EDTA and 0. 1% SDS. The messenger RNA from the products of eight amplifications, each performed 

was eiuted from the oligo dT column with 1 ml of buffer with a different 5' primer. Equal portions of the products 

consisting of 10 mM Tris-HQ. pH 7 J. 1 mM EDTA, at pH 60 from each primer extension reaction are mixed and the 

7.5. and 0.05% SDS. The messenger RNA was purified by mixed product is then used to generate a library of 

extracting this solution with phenol/chloroform followed by V^-coding DNA homolog-containing vectors, 

a single extraction with 100% chloroform. The messenger DNA homologs of the V L are also prepared from the 
RNA was concentrated by ethanol precipitation and purified mRNA prepared as described above. In preparation 

re-suspended in DEPC H 2 0. 65 for PCR amplification. mRNA prepared according to the 

The messenger RNA (mRNA) isolated by the above above examples is used as a template for cDNA synthesis, 
process contains a plurality of different V„ coding In a typical 50 ul transcription reaction, 5-10 ug of spleen 
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mRNA ia watcx is first annealed with 300 ng (50.0 pmol) of 
the 3* V L primer (primer 14, Table 5), at 65° C. for five 
minutes. Subsequently, the mixture is adjusted to 1.5 mM 
dATP. dCTP. dGTR and cTTTP. 40 mM Tris-HCl. pH 8.0. 8 
mM MgCl 2 . 50 mM NaCl. and 2 mM spermidine. Moloney- 
Murine Leukemia virus reverse transcriptase (Stratagene). 
26 units, is added and the solution is maintained for 1 hour 
at 37° C. The PCR amplification is performed in a 100 ul 
reaction containing approximately 5 ug of the cDNA/RNA 
hybrid produced as described above. 300 ng of the 3* V L 
primer (primer 14 of Table 5). 300 ng of the 5' primer 
(primer 16 of Table 5), 200 mM of a mixture of dNTP's. 50 
mM KCL 10 mM Tris-HCl, pH 83. 15 mM MgCl 2 , 0.1% 
gelatin and 2 units of Taq DNA polymerase. The reaction 
mixture is overlaid with mineral oil and subjected to 40 
cycles of amplification. Each amplification cycle includes 
denaturation at 92° C. for 1 minute, annealing at 52° C. for 
2 minutes and elongation at 72° C. for 1.5 minutes. The 
amplified samples are then extracted twice with phenol/ 
chloroform, once with chloroform, ethanol precipitated and 
are stored at -70° C. in 10 mM Tris-HCL 7.5 and 1 mM 
EDTA. 

d. Insertion of DNA Homologs into a DNA Expression 
Vector 

To prepare an expression library enriched in v w 
sequences. DNA homologs enriched in V w sequences are 
prepared according to Example 2c using the same set of 5* 
primers but with primer 12A (Table 5) as the 3' primer. The 
resulting PCR amplified products (2.5 ug/30 ul of 150 mM 
NaCl. 8 mM Tris-HCl. pH 7.5, 6 mM MgS0 4 . 1 mM DTT, 
200 ug/ml BSA) are digested at 37° C. with restriction 
enzymes Xho I (125 units) and Spe I (125 units). In cloning 
experiments which required a mixture of the products of the 
amplification reactions, equal volumes (50 ul, 1-10 ug 
concentration) of each reaction mixture are combined after 
amplification but before restriction digestion. The W H 
homologs are purified on a 1% agarose gel using the 
standard electro-elution technique described in Molecular 
Cloning A Laboratory Manual. Maniatis et aL. eds.. Cold 
Spring Harbor. N.Y.. (1982). After gel electrophoresis of the 
digested PCR amplified spleen mRNA. the region of the gel 
containing DNA fragments of approximate 350 bps is 
excised, electro- el u ted into a dialysis membrane, ethanol 
precipitated and re-suspended suspended in a TE solution 
containing 10 mM Tris-HCX pH 7.5 and 1 mM EDTA to a 
final concentration of 50 ngful. The resulting V„ DNA 
homologs represent a repertoire of polypeptide genes having 
cohesive termini adapted for directional ligation to the 
vector Lambda Hc2. Tnese prepared V H DNA homologs are 
then directly inserted by directional ligation into linearized 
Lambda Hc2 expression vector prepared as described below. 

The Lambda Hc2 expression DNA vector is prepared for 
inserting a DNA homolog by admixing 100 ug of this DNA 
to a solution containing 250 units each of the restriction 
endonucleases Xho I and Spe I (both from Boehringer 
Mannheim. Indianapolis, Ind.) and a buffer recommended 
by the manufacturer. This solution is maintained at 37 from 
1.5 hours.The solution is heated at 65° C. for 15 minutes top 
inactivate the restriction endonucleases. The solution is 
chilled to 30° C. and 25 units of heat-killable (HK) phos- 
phatase (Epicenter. Madison, Wis.) and CaCl 2 is admixed to 
it according to the manufacturer's specifications. This solu- 
tion is maintained at 30° C. for 1 hour. The DNA is purified 
by extracting the solution with a mixture of phenol and 
chloroform followed by ethanol precipitation. The Lambda 
Hc2 expression vector is now ready for ligation to the V w 
DNA homologs prepared in the above examples. These 
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prepared V H DNA homologs are then directly inserted into 
the Xho I and Spe I restriction digested Lambda Hc2 
expression vector that prepared above by ligating 3 moles of 
V H DNA homolog inserts with each mole of the Hc2 

5 expression vector overnight at 5° C Approximately 3.0x10 s 
plague forming units are obtained after packaging the DNA 
with Gigapack II Bold (Stratagene) of which 50% are 
recombinants. The ligation mixture containing the V H DNA 
homologs arc packaged according to the manufacturers 

to specifications using Gigapack Gold H Packing Extract 
(Stratagene). The resulting Lambda Hc2 expression libraries 
are then transformed into XLl-Blue cells. 

To prepare a library enriched in sequences. PCR 
amplified products enriched in sequences are prepared 

15 according to Example 2c. These DNA homologs are 
digested with restriction enzymes Sac I and Xba I and the 
digested V L DNA homologs arc purified on a 1% agarose gel 
as described above for* the V w DNA homologs to form a 
repertoire of ^polypeptide genes adapted for directional 

20 ligation. The prepared DNA homologs are then direc- 
tionally ligated into the Lambda Lc2 expression vector 
previously digested with the restriction enzymes. Sac I and 
Xba I as described for Lambda Hc2. The ligation mixture 
containing the DNA homologs is packaged to form a 

25 Lambda Lc2 expression library as described above and is 
ready to be plated on XLl-Blue cells, 
e. Randomly Confining V w and DNA Homologs on the 
Same Expression Vector 
The construction of a library containing vectors for 

30 expressing two cistrons that express heavy and light chains 
is accomplished in two steps. In the first step, separate heavy 
and light chain libraries are constructed in the expression 
vectors Lambda Hc2 and Lambda Lc2, respectively, as 
described using gene repertoires obtained from a mouse 

35 immunized with NPN-KLH. In the second step, these two 
libraries are combined at the antisymmetric EcoR I sites 
present in each vector. This resulted in a library of clones 
each of which potentially co-expresses a heavy and a light 
chain. The actual combinations are random and do not 

40 necessarily reflect the combinations present in the B-cell 
population in the parent animal 

The spleen mRNA resulting from the above immuniza- 
tions (Example 2b) is isolated and used to create a primary 
library of V w gene sequences using the Lambda Hc2 expres- 

45 sion vector. The primary library contains 13xl0 6 plaque- 
forming units (pfu) and can be screened for the expression 
of the decapeptide tag to determine the percentage of clones 
expressing V H and C H \ (Fd) sequences. The sequence for 
this peptide is only in frame for expression following the 

50 cloning of a Fd (or V H ) fragment into the vector. At least 
80% of the clones in the library express Fd fragments based 
on immunodetection of the decapeptide tag. 

The light chain library is constructed in the same way as 
the heavy chain and contains 2.5xl0 6 members. Plaque 

55 screening, using an anti-kappa chain antibody, indicates that 
60% of the library contained express light chain inserts. A 
small percentage of inserts results from incomplete dephos- 
phorylation of vector after cleavage with Sac I and Xba L 
Once obtained, the two libraries are used to construct a 

60 combinatorial library by crossing them at the EcoR I site. To 
accomplish the cross, DNA is first purified from each library. 

The Lambda Lc2 library prepared in Example 2d is 
amplified and 500 ug of Lambda Lc2 expression library 
phage DNA is prepared from the amplified phage stock 

65 using the procedures described in Molecular Cloning: A 
Laboratory Manual* Maniatis et al., eds.. Cold Spring 
Harbor. N.Y. (1982). Fifty ug of this amplified expression 
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library phage DNA is maintained in a solution containing 
100 units of MLu I restriction endonuclcase (Boehringer 
Mannheim. Indianapolis. Ind.) in 200 pi of a buffer supplied 
by the endonuclease manufacturer for 1.5 hours at 37° C. 
The solution is then extracted with a mixture of phenol and 
chloroform. The DNA is then ethanol precipitated and 
re-suspended in 100 ul of water. This solution is admixed 
with 100 units of the restriction endonuclease EcoR I 
(Boehringer) in a final volume of 200 ul of buffer containing 
the components specified by the manufacturer. This solution 
is maintained at 37° C. for 1.5 hours and the solution is then 
extracted with a mixture of phenol and chloroform. The 
DNA was ethanol precipitated and the DNA re-suspended in 
TE. 

The Lambda Hc2 expression library prepared in Example 
2d is amplified and 500 ug of Lambda Hc2 expression 
library phage DNA is prepared using the -methods detailed 
above. 50 ug of this amplified library phage DNA is main- 
tained in a solution containing 100 units of Hind HI restric- 
tion endonuclease (Boehringer) in 200 ul of a buffer supplied 
by the endonuclease manufacturer for 1.5 hours at 37° C. 
The solution is then extracted with a mixture of phenol and 
chloroform saturated with 0. 1M Tris-HCl. pH 7.5. The DNA 
is then ethanol precipitated and re-suspended in 100 ul of 
water. This solution is admixed with 100 units of the 
restriction endonuclease EcoR I (Boehringer) in a final 
volume of 200 ul of buffer containing the components 
specified by the manufacturer. This solution is maintained at 
37° C. for 1.5 hours and the solution is then extracted with 
a mixture of phenol and chloroform. The DNA is ethanol 
precipitated and the DNA re-suspended in TE. 

The restriction digested Hc2 and Lc2 expression libraries 
are ligated together. To that end, a DNA admixture consists 
of 1 ug of Hc2 and 1 ug of Lc2 phage library DNA is 
prepared in a 10 ul reaction using the reagents supplied in a 
ligation kit (Stratagene). The DNA admixture is warmed to 
45° C. for 5 minutes to melt any cohesive termini that may 
have reannealed. The admixture is then chilled to 0° C. to 
prevent religation. Bacteriophage T4 DNAligase (0.1 Weiss 
units which is equivalent to 0.02 units as determined in an 
exonuclease resistance assay) is admixed into the chilled 
DNA solution along with 1 ul of 5 mM ATP and 1 ul 
lOxbacteriophage T4 DNA ligase buffer (lOxbuffer is pre- 
pared by admixing 200 raM Tris-HCl. pH 7.6. 50 mM 
Mgd 2 .50 mM DTT. and 500 ug/ml BSA) to form a ligation 45 
admixture. After ligation for 16 hr at 4° C. 1 ul of the ligated 
the phage DNA is packaged with Gigapack Gold II pack- 
aging extract and plated on XL 1 -Blue cells prepared accord- 
ing to the manufacturers instructions to form a Lambda 



bound NPN. In. a typical procedure 30.000 phage were 
plated oq XLl-Blue cells and duplicate lifts with nitrocel- 
lulose were screened for binding to NPN coupled to l25 I 
labeled BSA. The BSA was iodinated following the 
Chloramine-T method as described by Bolton et al.. 
Biochem., 133:529-534 (1973). Duplicate screens of 80,000 
recombinant phage from the light chain library and a similar 
number from the heavy chain library did not identify any 
clones which bound the antigen. In contrast, the screen of a 
similar number of clones from the Fab expression library 
identified many phage plaques that bound NPN. This obser- 
vation indicates that under conditions where many heavy 
chains in combination with light chains bind to antigen the 
same heavy or light chains alone do not Therefore, in the 
case of NPN. it is believed that there are many heavy and 
light chains that only bind antigen when they are combined 
with specific light and heavy chains respectively. 

To assess the ability to screen large numbers of clones and 
obtain a more quantitative estimate of the frequency of 
antigen binding clones in the combinatorial library, one 
million phage plaques were screened and approximately 100 
clones which bound to antigen were identified. For six 
clones which were believed to bind NPN. a region of the 
plate containing the six positive and approximately 20 
surrounding bacteriophage plaques was selected and each 
plaque was cored, replated, and screened with duplicate lifts. 
As expected, approximately one in twenty of the phage 
specifically bound to antigen. Cores of regions of the plated 
phage believed to be negative did not give positives on 
replating. 

Clone 2b. one of the plaques which reacted with NPN. 
was excised according to an in vivo excision protocol where 
200 ul of phage stock and 200 ul of a F+ derivative of 
XLl-Blue (A<5oo=1.00) (Stratagene) were admixed with 1 ul 
of M13mp8 helper phage (lxiO 10 pfu/milliliter (ml)) and 
maintained at 37° C. for 15 minutes. After a four hour 
maintenance in Luria-Bertani (LB) medium and heating at 
70° C. for 20 minutes to heat kOl the XLl-Blue cells, the 
phagemids were re-infected into XLl-Blue cells and plated 
40 onto LB plates containing ampicillin. This procedure con- 
verted the cloned insert from the lambda Zap H vector into 
a plasmid vector to allow easy manipulation and sequencing 
(Stratagene). The phagemid DNA encoding the W H and part 
of the y L was then determined by DNA sequencing using the 
Sanger dideoxy method described in Sanger et al.. Proc. 
Natl Acad Set., 74:5463-5467 ( 1977) using a Sequenase kit 
according to manufacturer's instructions (US Biochemical 
Corp.. Cleveland. Ohio). The nucleotide residue sequence of 
Clone 2b Fd chain is listed in the Sequence Listing as SEQ. 
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phage library of dicistronic expression vectors capable of 50 ID. NO. 99. The nucleotide residue sequences of the kappa 



expressing heavy and light chains derived from the NPN 
immunized mouse. A portion of the clones obtained are used 
to determine the effectiveness of the combination, 
f. Selection of Anti-NPN Reactive HetercKliiner-Producing 
Dicistronic Vectors 

The combinatorial Fab expression library prepared above 
in Example 2a was screened to identify clones having 
affinity for NPN. To determine the frequency of the phage 
clones which co-expressed the light and heavy chain 
fragments, duplicate lifts of the light chain, heavy chain and 
combinatorial libraries were screened as above for light and 
heavy chain expression. In this study of approximately 500 
recombinant phage, approximately 60% co-expressed light 
and heavy chain proteins. 

All three libraries, the light chain, the heavy chain and the 
combinatorial, were screened to determine if they contained 
recombinant phage that expressed antibody fragments which 
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tight chain variable and constant regions are listed in the 
Sequence Listing as SEQ. ID. NO. 100 and SEQ. ID. NO. 
101. respectively. 

g. Preparation of a DNA Sequence Encoding a Filamentous 
Phage Coat Protein Membrane Anchor 

cpVm Membrane Anchor M13mpl8. a commercially 
available bacteriophage vector (Pharmacia. Piscataway, 
N J.), was used as a source for isolating the gene encoding 
cpVUL The sequence of the gene encoding the membrane 
anchor domain of cpVUJ listed in Sequence Listing as SEQ. 
ID. NO. 102, was modified through PCR amplification to 
incorporate the restriction endonuclease sites, Spe I and 
EcoR L and two stop codons prior to the EcoR I site. The 
corresponding amino acid residue sequence of the mem- 
brane anchor domain of cpVm is listed as SEQ. ID. NO. 17. 

To prepare a modified cpVITJ, replicative form DNA from 
M13mpl8 was first isolated. Briefly, into 2 ml of LB 
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(Luria-Bertani medium). 50 ul of a culture of a bacterial of which are listed in Table 7 below, were used in the PGR 

strain carrying an F episome (JM107. JM109 or TGI) was reaction to amplify the mature gene for cpVUI member 

admixed with a one tenth suspension of bacteriophage anchor domain and incorporate the two cloning sites. Spe I 

particles derived from a single plaque. The admixture was and L For me rcacl j oni 2 ul containing 1 

incubated for 4 to 5 hours at 37° C.wto^^ 5 nanogram (ng) of M 13mpl8 replicative form DNA was 

The admixture was then centrifuged at 12 OOOxg for 5 admixed with 10 ul of 10xPCR buffcr purchased commer- 

minutes to pellet the infected bacteria. After the supernatant . a . «,.„ x ;» « n < m t 

Z removeTthe pellet was resuspended by vigorous vor- p™^*^**™ « L Z 1 s j£ 

toeing in 100 ul of ice-cold solution 1 Solution I was microfuge tube. To the DNA admixture. 8 ul of a 2.5 mM 

prepared by admixing 50 mM glucose. 10 mM EDTA and 25 10 solution of dNTPs (dATP. dCTP. dGTP. dTTP) was admixed 

mM Tris-HQ. pH 8.0. and autoclaving for 15 minutes. to result in a final concentration of 200 microraolar (uM). 

To the bacterial suspension. 200 ul of freshly prepared Three ul (equivalent to 60 picomoles (pM)) of the 5' forward 
Solution II was admixed and the tube was rapidly inverted AK 5 primer and 3 ul (60 pM) of the 3' backward AK 6 
five times. Solution II was prepared by admixing 0.2N primer was admixed into the DNA solution. To the 
NaOH and \% SDS. To the bacterial suspension. 150 ul of is adn^t^e. 73 ui 0 f sterile water and 1 ul/5 units of poly- 
ice-cold Solution III was admixed and the tube was vortexed mer & e (pj-omega Biotech) was added. Two drops of mineral 
gently in an inverted position for 10 seconds for to disperse oU wefe . d on of ^ aiin ^ t ^ t ^ 40 rounds of 
Solution HI through the viscous bacterial lysate. Solution m ^ m a thermocycler were performed. The 
was prepared by admixing 60 ml of 5M podium acetate. le consisted of 52 . c for 2 minutes. 72" C. 
11.5 ml of glacijd acetic acid and 28 inj Id . «^ 20 fo?i.5 minutesand 91° C. for 2 minutes. The resultant PGR 
resultant bactenalrysate was ^^onice for 5 ennutes mcmbnm e anchor domain DNA fragment 
followed bv centrifueation at 12.000xg tor 5 minutes at 4 ^ 

cTa^ofoge. lie resultant supernatant was recovered from M 13mpl8 containing samples were then purified with 

and transferred to a new tube. To tTsupexnatant was added Gene Clean (BIO101, JoUa COif), exacted twice with 

an equal volume of phenol: chloroform and the admixture 25 phenoVchlorofornL once with chloroform followed by etha- 

was vortexed. The admixture was then centrifuged at nol precipitation and were stored at -70° C. in 10 mM 

12.000xg for 2 minutes in a microfuge. The resultant super- Tris-HQ. pH 7.5. and 1 .mM EDTA. 

TABLE 7 



SEQ. 

ID NO. Primer 



(103) 1 AK 5 (F) S GTGCCCAGGGATTCn'ACrrAGTXjCTGAOGGTCACG^^ J 

(104) 3 AK 6 (B) 5* AUHJ UAA1 1L ! JLAJLAU TUXJCl 1 lLUAtAilUAA i' 

(105) 3 Hc3 (F) 5* AOOTCCAGCTTCTCGAGTCTGG 3" 

(106/ AK1 (B) S GTCACCCTC AGCAC TAGTAC AATC CCTGGGC AC 3' 
(107f G-3 (F) 5 GAUAL UAL TXrjIGGTCXXX^jlUUC llH-VAl li: 
O'l 1 Iti 1UAAIATCOT T 

(108) 6 G-3(B) 5 1 LAC LAUC 1 ALjCAXAAlAAC GG AAIACCCAAAA 

GAACTGG 3' 

(109) 7 LAC-F 5* 1A1UL' 1AU C1AGTAAC ACGACAGG 1' 1 'ACCCGAC 

TGGTT ~~ 

<110) B LAC-B 3' ACCGAGCT CGAATTC GTAATCATTXjIC 3' 

F Forward Primer 
B Backward Primer 

'From S to 3': tl* overlapping sequence for C„l 3' end is double under lined; the Spe I 
restriction site sequerce is single underlined; the overlapping sequence for cpVTH is double 
underlined. 

7 EcoR I restriction site sequence is single underlined 
*Xho I restriction site sequence is underlined 

J *From5' to 3': the overlapping sequence for cpvm is double underlined; the Spe I restriction 
site sequence is single underlined; the overlapping sequence for C„l 3' end is double 
underlined. 

^From 5' to 3*: Spe I restriction site sequence is single underlined; the overlapping sequence 
with the 5' end of cpIH is double underlined 

•From 5* to 3': Nhe I restriction site sequence is single underlined; (he overlapping sequence 

with 3* end of cpHI is double underlined. 

''From 5* to 3': overlapping sequence with the 3* end of cpffl is double undrrlmed; Nhe I 
restriction sequence begins with the nucleotide residue "G" at position 4 and extends 5 more 
resklues = GCTAGC. 

•EcoR I restriction she sequence is single i mnVrlnird 



natant was transferred to a new tube and the double-stranded To verify amplification of the modified cpVIII membrane 

bacteriophage DNA was precipitated with 2 volumes of anchor domain, the PCR purified DNA products were eiec- 

ethanol at room temperature. After allowing the adinixture tropboresed in a \% agarose geL The expected size of the 

to stand at room temperature for 2 minutes, the admixture cpvm was approximately 150 base pairs. The area in the 

was centrifuged to pellet the DNA. The supernatant was ^ containin ^ mo dified cpVTH DNA fragment was 

removed and *e pellet Xl^rTO ^ from * e ^ as dcscribcd abovc ' Thc sc< * ucncc 

pended uq 22 ul of Tns-HCL pH 7.6. and 10 mM K17IA (Tfc). . j.- . um n M A «__„, „ 

The double-stranded M13mpl8 replicative form DNA 6J of (he isolated modified cpVUI DNA fragment is listed as 

was then used as a template for PCR. Primers. AK 5 (SEQ. SEQ. ED. NO. 1 1 1. The isolated cpVffl DNA fragment was 

ID. NO. 103) and AK 6 (SEQ. ED. NO. 104). the sequences then admixed with a similarly prepared fragment of modi- 
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fied Fd as described below in Example 2i in order to form 
a DNA segment encoding the fusion protein Fd-cpVUL 

cpIH Membrane Anchor: M13mpl8 was also used as a 
source for isolating the gene encoding the membrane anchor 
domain at cpm. the sequence of which is listed in the 
Sequence Listing as SEQ. ID. NO. 112. The amino acid 
residue sequence of membrane anchor domain cpm is listed 
in SEQ. II). NO. 16. M13mpl8 repiicative form DNA was 
prepared as described above and used as a template for PCR 
for amplifying the mature gene for cpm membrane anchor 
domain and incorporating the two cloning sites. Spe I and 
EcoR I. 

The primer pair. G-3(F) (SEQ. ID. NO. 107) and G-3(B) 
(SEQ. ID. NO. 108) listed in Table 7. was used in PCR as 
performed above to incorporate Spe I and Nhe I restriction 
sites. The resultant PCR modified cpm DNA fragment was 
verified and purified as described above. The sequence of the 
PCR modified cpm membrane anchor domain DNA frag- 
ment is listed in the Sequence Listing as SEQ. ID. NO. 113. 
A second PCR amplification using the primer pairs. Lac-F 
(SEQ. ID. NO. 109) and Lac-B (SEQ. ID. NO. 110) listed 
in Table 7. was performed on a separate aliquot of M13mpl8 
repiicative form template DNA. The primers used for this 
amplification were designed to incorporate an overlapping 
sequence with the nucleotides encoding the membrane 
anchor region of cpm. and the adjacent Nhe I site along with 
a sequence encoding a LacZ promoter region 5" to an EcoR 
I restriction site. The reaction and purification of the PCR 
product performed as described above. The sequence of the 
resultant PCR modified cpm DNA fragment having Nho I 
and EcoR I restriction sites is listed in the Sequence Listing 
as SEQ. ID. NO. 114. 

The products of the first and second PCR amplifications 
were then re combined at the nucleotides corresponding to 
cpm membrane anchor overlap and Nhe I restriction site 
and subjected to a second round of PCR using the G3-F 
(SEQ. ID. NO. 107) and Lac-B (SEQ. ID. NO. 110) primer 
pair to form a recombined PCR DNA fragment product 
consisting of the following: a 5' Spe I restriction site; a cpm 
DNA membrane anchor domain beginning at the nucleotide 
residue sequence which corresponds to the amino acid 
residue 198 of the entire mature cpm protein; an endog- 
enous stop site provided by the membrane anchor at amino 
acid residue number 112; a Nhe I restriction site, a LacZ 
promoter sequence; and a 3' EcoR I restriction site. The 
recombined PCR modified cpm membrane anchor domain 
DNA fragment was then restriction digested with Spe I and 
EcoR I to produce a DNA fragment for directional ligation 
into a pComb phagemid expression vector prepared in 
Example la(iv) and to form a pCombm phagemid expres- 
sion vector as described in Example lb(ii). 

h. Isolation of Anti-NPN Coding V„ DNA Segment 

To prepare modified Fd fragments for recombination with 
the PCR modified cpVm membrane anchor domain frag- 
ment to form a Fd-cpVIH DNA fusion product PCR ampli- 
fication as described above was performed using Clone 2b. 
prepared in Example 2f. as a template. The primers. Hc3 
(SEQ. ID. NO. 105) and AK 7 (SEQ. ID. NO. 106), the 
sequences of which are listed in Table 7, were used in PCR 
- to amplify the Fd portion of the Clone 2b and incorporate 
Xho I and Spe I cloning sites along with a cpVTH overlap- 
ping sequence. Hie amplified PCR modified Fd product was 
purified, electrophoresed and isolated from 1% agarose gels 
as described above. The size of the Fd fragment was 680 
base pairs. 

i. Preparation of a DNA Segment Encoding a Portion of the 
Fusion Protein Fd-cpVm 
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The purified PCR modified Fd DNA fragment containing 
cpVm overlapping nucleotide sequences prepared above 
was then admixed with the PCR modified cpVTH membrane 
anchor domain fragment to form an admixture. The frag- 

5 ments in the admixture were allowed to recombine at their 
complementary regions. The admixture containing the 
recombined PCR fragments was then subjected to a second 
round of PCR amplification as described above using the 
end primer pair AK 6 (SEQ. ID. NO. 104) and Hc3 (SEQ. 

10 ED. NO. 105) (Table 7). The corresponding product of the 
PCR amplification was purified and electrophoresed on 
agarose gels as described above. The PCR product was 
determined to be approximately 830 base pairs (Fd=680+ 
150) coiifirrning the fusion of Fd with cpVm. The sequence 

15 of the PCR product linking the Fd sequence with the cpVUI 
sequence in frame in a 5* to 3* direction is listed as SEQ. ID. 
NO. 115. The Fd-cpVUI fusion product was then used in 
directional ligations described in Example 2j for the con- 
struction of a pCBAK8-2b dicistronic phagemid expression 

20 vector. 

j. Construction of pCBAK8-2b Dicistronic Expression Vec- 
tor 

To construct a phagemid vector for the coordinate expres- 
sion of a Fd-cpvm fusion protein with kappa light chain, the 

25 PCR amplified Fd-cpVUI fusion product prepared in above 
in Example 2i was first ligated into Clone 2b phagemid 
expression vector isolated from the NPN combinatorial 
library prepared in Example 2f. For the ligation, the 
Fd-cpvm PCR fusion product was first restriction digested 

30 with Xho I and EcoR L Clone 2b phagemid vector was 
similarly digested resulting in the removal of the cloning and 
decapeptide regions. The digested Fd-cpVm fragment was 
admixed and ligated into the digested Clone 2b at the 
cohesive termini generated by Xho I and EcoR I restriction 

35 digestion. The ligation resulted in operatively linking the 
nucleotide 'residue sequence encoding the Fd-cpvm 
polypeptide fusion protein to a second cassette having the 
nucleotide residue sequences encoding the ribosome binding 
site, a pelB leader sequence and the kappa light chain 

40 already present in Clone 2b to form a dicistronic DNA 
molecule in the original Clone 2b phagemid expression 
vector. 

E. cott* strain TGI. was then transformed with the 
phagemid containing the dicistronic DNA molecule and 
45 transformants were selected on ampicillin as the original 
Clone 2b contained an ampicillin selectable resistance 
marker gene. For high efficiency electro-transformation of 
E. coli, a 1:100 volume of an overnight culture of TGI cells 
was inoculated into one liter of L-broth (1% Bacto tryptone, 
50 0.5% Bacto yeast extract. 0.5% NaCl). The cell suspension 
was maintained at 37° C. with vigorous shaking to a 
absorbance at 600 nm of 0.5 to 1.0. The cell suspension in 
log phase growth was then harvested by first chilling the 
flask on ice for 15 to 30 minutes followed by centrifugation 
55 in a cold rotor at 4000 g for 15 minutes to pellet the bacteria. 
The resultant supernatant was removed and the bacterial cell 
pellet was resuspended in a total of one liter of cold water 
to form a cell suspension. The centrifugation and resuspen- 
sion procedure was repeated two more times and after the 
60 final centrifugation. the cell pellet was resuspended in 20 ml 
of cold 10% glycerol. The resuspended cell suspension was 
then centrifuged to form a cell pellet. The resultant cell pellet 
was resuspended to a final volume of 2 to 3 ml in cold 10% 
glycerol resulting in a cell concentration of 1 to 3xl0 10 
65 cells/ml For the electro-transformation procedure, 40 ul of 
the prepared cell suspension was admixed with 1 to 2 ul of 
phagemid DNA to form a cell-phagemid DNA admixture. 
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The resultant admixture was mixed and allowed to sit on ice A phagemid expression vector, pCBAK3-2b. for the 
for one minute An eiectroporation apparatus, for example a expression of a fusion protein and kappa light chain was then 
Gene Pulsar, was set a 25 uF and 2.5 kV. The pulse controller constructed. Briefly, the pCBAK3 phagemid expression 
was set to 200 ohms. The cell-DNA admixture was trans- vector prepared above was first digested with Xho I and Spe 
ferred to a cold 0.2 cm eiectroporation cuvette. The cuvette 5 I to form a linearized pCB AK3 phagemid expression vector, 
was then placed in the chilled safety chamber and pulsed PGR amplified and modified Fd fragment prepared in 
once at the above settings. To the pulsed admixture. 1 ml of Example 2h containing Xho I and Spe I sites, was subse- 
SOC medium was then admixed and the cells were resus- quently restriction digested with Xho I and Spe L The 
pended with a Pasteur pipette (SOC medium was prepared resultant Fd fragment was then directionally Hgated via 
by admixing 2% Bacto tryptone. 0.5% Bacto yeast extract. 10 cohesive termini into the Xho I and Spe I restriction digested 
10 mM NaCl. 2.5 mM KC1. 10 mM Mgd 2 , 10 mM MgS0 4 . pCBAK3 phagemid expression vector to form a second 
and 20 mM glucose). The cells suspension was then trans- phagemid expression vector in which the PGR modified Fd 
ferred to a 17x100 mm polypropylene tube and maintained fragment was operatively linked in-frame to nucleotide 
at 37° C for one hour. After the maintenance period, the residue sequences encoding cpHL £. coU strain XLl-Blue 
transformed TGI cells were then plated on ampicillin LB 15 (Stratagene) was then transformed with the above phagemid 
plates for selection of ampicillin resistant colonies contain- vector containing Fd-cpffl. Transformants containing the 
ing the phagemid which provided the selectable marker Fd-cpffl encoding phagemid were selected on chloram- 
eene phcnicoL Phagemid DNA was isolated from chloramphem- 

Ampicillin resistant colonies were selected and analy2ed col resistant clones and was restriction digested with Sac I 
for the correct insert size and expression of Fab. Briefly. 20 and Xba I to form a unearized phagemid expression vector 
DNA minipreps of selected colonies were prepared for the into which a Sac I and Xba I light chain fragment prepared 
isolation of phagemid DNA. The isolated phagemid DNA below was directionally ligated. 

from each ininiprep was restriction digested with Xho I and Phagemid Clone 2b. isolated from the original combina- 
EcoR I and the digests were electrophoresed on a \% torial library as described in Example 2a. was restriction 
agarose gel. Clone AK16 was selected as an 830 bp fragment 25 digested with Sac I and Xba* I to isolate the nucleotide 
was visualized on the gels confirming the insertion of the residue sequence encoding the kappa light chain. The iso- 
Fd-cpVm PCR fusion product into digested Clone 2b. lated kappa light chain sequence was then directionally 

Gone AK16 phagemid was then restriction digested with ligated into the Sac I and Xba I restriction digested 
Xho I and Xba I and the nucleotide residue sequence of the phagemid expression vector prepared above containing 
dicistronic DNA molecule encoding the Fd-cpVUI fusion 30 Fd-cpffl to form the phagemid expression vector. pCBAK3- 
protein the ribosome binding site and pelB leader sequence 2b. The resultant vector contained the nucleotide residue 
for expression of the ught chain, a spacer region and the 2b sequence of a dicistronic DNA molecule for the coordinate 
kappa Ught chain was isolated by agarose gel electrophore- expression of a Fd-cpffl fusion protein with kappa light 
sis The isolated dicistronic DNA fragment was then ligated chain. The resultant phagemid expression vector consisted 
into a Xho I and Xba I restriction digested pCBAKO 35 of nucleotide residue sequences encoding the following 
expression vector prepared in Example lc(ii) to form a elements: fl filamentous phage origin of replication; a 
dicistronic phagemid expression vector designated chloramphenicol acetyl transferase selectable resistance 
pCBAK8-2b. marker gene; an inducible LacZ promoter upstream from the 

The resultant pCBAK8-2b expression vector consisted of LacZ gene; a multiple cloning site flanked by T3 and T7 
nucleotide residue sequences encoding the following ele- 40 polymerase promoters; and the dicistronic molecule (a first 
ments* f 1 filamentous phage origin of replication; a chloram- cassette consisting of a first ribosome binding site and pelB 
phenicol acetyl transferase selectable resistance marker leader operatively linked to Fd-cpffl operatively linked to a 
gene- an inducible LacZ promoter upstream from the LacZ second cassette consisting of a second LacZ. a second 
gene! a multiple cloning site flanked by T3 and T7 poly- ribosome binding site, and a second pelB leader operatively 
merase promoters; and the dicistronic DNA molecule (a first 45 linked to a kappa Ught chain). 

cassette consisting of a ribosome binding site, a pelB leader, XLl-Blue cells were then transformed with the phagemid 
and a Fd-cpVUJ DNA fusion product operatively linked to expression vector pCBAK3-2b. Transformed colonies con- 
a second cassette consisting of a second ribosome binding taining the chloramphenicol resistant phagemids were 
site, a second pelb leader, and a kappa light chain). selected as described above and analyzed for the correct sue 

k. Construction of pCBAK3-2b Dicistronic Expression Vec- 50 insert and expression of Fab as described in Example 2j. 
tor Following verification of the insert and expression of Fab in 

To construct a phagemid vector for the coordinate exprcs- the pCBAK3-2b phagemid vector. XLl-Blue cells were then 
sion of a Fd-cpffl fusion protein with kappa light chain, the transformed and induced for the expression of Fab antibod- 
PCR amplified and recombined cpm membrane anchor ies as described in Examples 3 and 4. 
prepared in Example 2g having a S Spe I and 3' EcoR I 55 3 & ion of ^.mpn Heterodimer on Phage 
restriction site was first directionally ligated into a pComo r Surfaces 

phagemid expression vector prepared in Example la(iv) to 

form a pComblH phagemid vector. See Example lb(ii) for For expression of antibody Fab directed against NPN on 
details of vector construction. The resultant pCombin phage surfaces, XLl-Blue cells were separately transformed 
phagemid vector was then restriction digested with Sac H 60 with the phagemid vectors. pCBAK8-2b and pCBAK3-2b. 
and Apa I to form an isolated fragment The resultant prepared in Examples 2j and 2k. respectively. The transfer- 
isolated fragment containing the expression control mants were selected on LB plates containing 30 ug/rol 
sequences and the cpffl sequence was then directionally cWc*amphenicoL Antibiotic resistant colonies were selected 
Ugated into a similarly digested pCBAKO phagemid vector for each phagemid transformation and grown in liquid 
prepared in Example lc(ii) to form a pCBAiG phagemid 65 cultures at 37° C in super broth (super broth was prepared 
expression vector. This vector lacked Fd and kappa Ught by admixing the following: 20 g 3 (N-MorpholinoJ propane- 
chain sequences. sulfonic acid (MOPS); 60 g tryptone; 40 g yeast extract; and 
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2 liter of water; adjust pH to 7.0 with 10 m NaOH) 
containing 30 ug/mi chloramphenicol and 12.5 ug/ml tetra- 
cycline for the respective antibiotic selection of the 
phagemid and the F episome. The antibiotic resistant trans- 
formed XLl-Blue cells were diluted to an optical density 
(OD^ am) of 0.4 in super broth. The inducer, isopropyi 
thiogalactopyranoside (IPTG). was admixed to the bacterial 
suspension for a final concentration of 1 mM and the 
admixture was maintained at 37° C. for 1 hour to induce the 
expression of the fusion protein and kappa light chain from 
the LacZ promoter. Helper phage, either R408 or VCS M13 
(Stratagene). was then admixed to the induced bacterial 
suspension at a ratio of 10-20 helper phage to 1 transformed 
bacterial cell to initiate the generation of copies of the sense 
strand of the phagemid DNA. The admixture containing the 
helper phage was then maintained for an additional two 
hours at 37° C. to allow for filamentous bacteriophage 
assembly wherein the expressed anti-NPN Fab antibodies 
fused to either bacteriophage membrane anchor domains of 
cpvm or cpm were incorporated into surface of the bac- 
teriophage particles. The bacterial suspension was then 
centrifuged resulting in a bacterial cell pellet and a super- 
natant containing phage. The supernatant was removed, 
collected and assayed as described below for the presence of 
functional anti-NPN Fab molecules anchored to the phage 
particles by either cpVm or cpDL 

4. Assays for Verifying the Presence and Function 
of Anti-NPN Heterodimer on the Surface of 
Filamentous Phage 

a. Electron Microscopy 

To localize functional Fab molecules, the binding to 
antigen labelled with colloidal gold was studied Phage 
containing supernatants and bacterial cells prepared in 
Example 3 were spotted on formvar Polysciences, Inc., 
Warrington. Pa.) coated grids affixed onto a solid phase. In 
some experiments grids were coated with cells and infected 
with phage in situ. Subsequently grids were blocked with 
bovine serum albumin (BSA) 1% in PBS at pH 7.2, washed 
and incubated with 2-7 nanometer (nra) colloidal gold 
particles coated with BSA-NPN hapten conjugate for a time 
period sufficient to form a labeled immunoreaction complex. 
The grids were washed to remove excess gold particles and 
negatively stained in uranylacetate and visualized by elec- 
tron microscopy. 

Examination of filamentous phage and permeabilized 
cells producing phage revealed specific labelling of phage or 
exposed bacterial membranes. Phage were observed to con- 
tain 1 to 24 copies of antigen binding sites per particle. 
Neither helper phage alone nor intact £. coli labelled with 
antigen. Background nonspecific binding was very low. 
Filamentous phage particles emerging from the E. coli 
surfaces were labelled with antigen as shown in FIG. 9. 

The generation of a related phage surface expression 
vector utilizing cpm as a fusion partner with Gone 2b, 
pCBAK3-2b, revealed specific antigen labelling to the 
phage head but not the column. Additionally human anti- 
tetanus Fab expressed as a cpin fusion did not bind to 
BSA-NPN antigen, 

b. Phase Elisa 

Microtitration plates were coated with NPN-BSA conju- 
gate (0.1 ml. 1 ugtal in 0. 1M Tris-HCl pH 9.2). and blocked 
with 1% BSA in PBS. Serial two fold dilutions of pCBAKS- 
2b derived phage (0.1 ml), prepared in Example 3. were 
added to the pre-coated microtitration plate and incubated 
for 3 hours at ambient temperature or 16 hours at 4° C. The 
plates were washed with PBS and goat anti-kappa alkaline 
phosphatase conjugate (Fisher Biotech. Pittsburgh. Pa.) 
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added (0-1 ml diluted 1/1000 in PBS containing 0.1% BSA) 
and incubated for 2 hours at room temperature. The plates 
were washed in PBS and substrate added (0. 1 ml, 1 mg/ml 
p-nitrophenylphosphate in 0.1M Tris-HCl. pH 9.5. contain- 

5 ing 50 mM MgQ 2 ). After incubation at 37° C. for signal 
development, the optical densities at 400 nm were deter- 
mined. Competition assays were performed with the addi- 
tion of increasing amounts of free NPN hapten ranging from 
zero up to 5 mgAvell. 

10 The ELISA assays confirmed the presence of functional 
antibody Fab. In a two site ELISA on NPN antigen coated 
plates when probed with anti-mouse kappa chain enzyme 
conjugate, phage supernatant generated from helper phage 
infection of cells carrying the pCBAKS-2b construct exhib- 

15 ited expected titration curves with serial two fold dilutions 
of phage containing antibody. The results of the two-site 
EUSA are shown in FIG. 10. For a signal to be generated in 
this assay, the phage particle must (i) have functionally 
associated Fd and kappa chains and (ii) be multivalent 

20 Specificity of the particle was assessed by inhibiting binding 
to the plate in the presence of increasing concentrations free 
hapten. The generated phage particles exhibited binding to 
solid phase of the ELISA and could be inhibited by addition 
of hapten as shown in FIG. 11. Complete inhibition was 

25 achieved when 5 ng of free NPN hapten was used in the 
assay. Helper phage did not give a signal in the ELBA, 
c. Antigen Specific Precipitation of Phage 

Phage supernatant from XLl-Blue was transformed with 
the pCBAK8-2b dicistronic expression vector prepared in 

30 Example 3 (1 ml) was incubated with BSA-NPN conjugate 
( 10 pL 2 mg/ml) for 18 hours at 4° C. The mixture was then 
pelleted by centrifugarion at 3000 rpm on a bench top 
centrifuge and the appearance of precipitate noted. Helper 
phage was used as a control. The pellet was washed repeat- 

35 edly in cold PBS (5x3 ml/wash) and then resuspended in LB 
(0.5 ml). The solubilized precipitates were added to fresh 
XLl-Blue cells (0.5 ml of overnight culture), incubated for 
1 hour at 37° C. and aliquots plated out on LB agar 
containing chloramphenicol (30 jig/ml). Colonies were 

40 selected randomly. Colony lifts on nitrocellulose were 
treated with lysozyme to digest the cell wall, briefly treated 
with chloroform to breakdown the outer membrane, blocked 
in BSA 1% in PBS and incubated with 125 I labelled BSA- 
NPN antigen. After several washes in PBS (containing 

45 0.05% Tween-20). film was exposed to the washed and dried 
filter overnight at -70° C. and the autoradiographs were then 
developed. 

Precipitates were obtained with antibody containing 
phage but not helper phage in the presence of BSA-NPN. In 
50 addition, the particles retained infectiviry on subsequent 
incubation with bacterial cells carrying the F episome and 
generated 4x10* colonies from a single solubilized precipi- 
tate. 

Additionally. DNA restriction analysis was carried out to 
55 determine the presence of heavy and light chain inserts. 
DNA restriction analysis of the clones revealed the presence 
of a. Xho and Xba I fragment of 1.4 kb as expected for 
Fd-cpVTQ fusion construct and kappa chain insert 
These results give additional evidence for antigen speci- 
60 ficity and multivalency. In addition to providing immuno- 
logical parameters, this precipitation offers possibilities for 
facile enrichment of antigen specific phage particles. In 
principle, phage containing specific antibodies can be highly 
enriched by precipitation with antigens (which may be cell 
65 surface markers, viral, bacterial as well as synthetic 
molecules). The washed antigen-antibody precipitates can 
be solubilized by the addition of excess antigen and viable 
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phage recovered. For the recovery of rare species an immo- 
bilized antigen may be used which opens the way for 
differential affinity elution. 

In order to demonstrate the utility of immobilized antigen 
for the enrichment of clones of defined binding specificity, 
a panning experiment was performed. An ampicillin resis- 
tant phagemid expressing an anti- tetanus Fab as a cpVIH 
fusion was constructed. Rescue of this clone with helper 
phage produced phage encoding the ampicillin resistant 
phagemid which displayed the anti-tetanus Fab on their coat. 
These phage encoding tetanus specificity were admixed with 
NFN hapten encoding phage (1:100) and allowed to bind to 
a microtitration plate coated with tetanus toxoid Following 
a one hour maintenance period, the plate was washed 
extensively and phage were then eluted with a low pH 
buffer. Infection of XL 1 -Blue cells in log phase growth and 
subsequent plating of aliquots on ampicillin and chloram- 
phenicol allowed for direct quantitation of enrichment 
Examination of over 1.000 colonies showed that ampicillin 
resistant colonies derived from the eluted phage exceeded 
chloramphenicol resistant colonies by 27 to 1. Therefore, 
panning enriched the phage displaying the anti-tetanus Fab 
by 2700 fold. This result suggests that a clone of defined 
specificity present at one part per million will dominate over 
nonspecific clones following two rounds of panning. 

5. Advantages of Assembling Combinatorial 
Antibody Fab Libraries Along Phage Surfaces 

A powerful technique for generating and selecting com- 
binatorial Fabs. with 10 8-5 * members, is presented. In the 
vector described herein, the restriction cloning sites for 
inserting PCR generated antibody fragments have been 
retained as previously reported for the lambda vector. The 
rescue of the genes encoding the antibody Fd and kappa 
chains is mediated through the utilization of the f 1 origin of 
replication leading to the synthesis and packaging of the 
positive strand of the vector on co-infection with helper 
phage. Since the 'mature' virus particle assembles by incor- 
porating the major coat protein around the single stranded 
DNA as it passes through the inner membrane into the 
periplasmic space, not only does it capture the genetic 
information carried on the phagemid vector but also incor- 
porates several copies of functional Fab along the length of 
the particle. On subsequent infection of hosts cells carrying 
the F episome the phagemid confers resistance allowing 
selection of colonies on the appropriate antibiotic In 
essence, the antigen recognition unit has been linked to 
instructions for its production. 

The full power of the earlier combinatorial system could 
not be fully utilized since screening allowed ready access to 
only about 0.1-1% of the members. In the phagcmid/M13 
system similar size libraries arc generated and all the mem- 
bers are accessed via affinity selection. Furthermore, unlike 
the lambda vector which generated monovalent Fabs, this 
system generates multivalent particles, thus allowing the 
capture of a wider range of affinities. 

The unique phagemid restriction sites permit the recom- 
bination of Fd and kappa chains allowing chain replacement 
or sh ufflin g The rescue of filamentous single stranded DNA 
allows rapid sequencing and analysis of the genetic make up 
of the clone of interest. Indeed it can be envisaged that phage 
encoding antibody specificity may be enriched by antigen 
selection prior to DNA sequencing or mutagenesis. The 
option to further develop an iterative process of mutation 
followed by selection may allow the rapid generation of high 
affinity antibodies from germ line sequences. The process 
may be automated Setting aside the potential of the system 
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to mimic nature, the phagemid/M13 system would allow a 
more complete dissection of the antibody response in 
humans which may yield useful therapeutic and diagnostic 
reagents. 

5 The membrane anchoring of the heavy chain and the 
compartmentalization of the kappa chain in the periplasm is 
the key to expressing this functional dim eric protein. The 
potential of this system is by no means limited to antibodies 
and may be extended to any protein recognition system or 

10 combination of systems containing multiple members. For 
example coupling of ligand and effector systems in a high 
avidity matrix is now possible. In a similar vein a library of 
ligands can be sorted against a library of receptors. 

15 6. Randomized Mutagenesis of the CDR3 Region 
of a Heavy Chain Encoding Tetanus Toxoid 
a. PCR Mutagenesis with Degenerate . Oligonucleotides 

To obtain a mutagenized heterodimer of this invention of 
altered specificity that would no longer recognize TT but 

20 would recognize and specifically bind to a new antigen, a 
method was developed to randomize only the CDR3 region 
of a heavy chain fragment encoded by a known nucleotide 
sequence. This approach is schematically diagrammed in 
FIG. 12 where a representative heavy chain fragment within 

25 a phagemid clone, consisting of alternating framework 
regions (1 through 4) shown by white blocks and comple- 
mentarity detennining regions (CDR) (1 through 3) shown 
by cross-hatched blocks and the first constant region (CHI), 
is subjected to two separate rounds of PCR- In the first PCR 

30 amplification reaction, the 5* end of the heavy chain begin- 
ning at framework 1 and extending to the 3' end of frame - 
r work 3 is amplified. In the second PCR amplification 
reaction, the CDR3 region is randomly mutagenized shown 
by the black box. This is accomplished through the use of a 

35 pool of oligonucleotide primers synthesized with a degen- 
erate region sandwiched between and contiguous with con- 
served framework 3 and 4 region sequences. The resulting 
amplification products, each having a randomized CDR3 
region, begin at the 3 ■ end of framework 3 and extend to the 

40 3* end of the CHI region. The pool of degenerate oligo- 
nucleotide primers have been designed to result in the 
amplification of products having a 5' end that is comple- 
mentary to and will overlap with the 3' end of the products 
of the first PCR reaction product Thus, the two separate 

45 PCR reaction products are pooled and subjected to a third 
PCR reaction in which the overlapping region between the 
two products is extended to result in heavy chain having a 
randomized CDR3 region. 

A heavy chain DNA template for use in this invention was 

50 available in a clone (a phagemid vector containing heavy 
and light chain fragments) from a human combinatorial 
anti-tetanus toxoid (TT) Fab library. This library was con- 
structed in the pCBAK-3 dicistronic expression vector for 
the expression of a heavy chain-cpin fusion protein (Fd- 

55 cpHI) and a soluble ligfrt chain as described for anti-NPN in 
Example 2k and by Persson et aL. Proc. Natl. Acad, Sci„ 
USA, 88:2432-2436 (1992) and Barbas et aL, Proc. Natl 
Acad. ScL, USA 88:7978-7982 (1992). A clone, hereinafter 
referred to as pCE-TT7E, was expressed as described for 

60 anti-NPN heterodirners on phage surfaces in Example 3 and 
subsequently screened by, panning on TT-coated plates as 
described for anti-NPN in Example 4c . Clone pCE-TT7E 
exhibited a towards TT on the order of 1(T 7 M and was 
enriched over nonspecific phage by ltf-fold as described by 

65 Barbas et aL. supra. Clone pCE-TT7E. having both heavy 
and light chain sequences, was used as the template DNA for 
the randomized mutagenesis of the CDR3 region of the 
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heavy chain to alter antigen binding specificity as described One hundred nanograms of gel purified products from the 
herein. The sequence of the heavy chain was determined as first and second PCR reactions were then admixed with 1 ug 
described in Example la(ii). Two separate PCR reactions each of FTX3 and CGiZ oligonucleotide primers as a 
were performed as illustrated in FIG. 12. primer pair in a final PCR reaction to form a complete heavy 

The first PCR reaction resulted in the amplification of the 5 caa in fragment by overlap extension as illustrated in FIG. 
region of the heavy chain fragment in the pC3-TT7E clone 12 . The PCR reaction admixture also contained 10 ul 
beginning of framework region 1 and extending to the end lOxPCR buffer. 1 ul Taq polymerase and 8 ui 2 J mM d 
of framework region 3 which is located 5' to CDR3 which ^yp* s as described above. The PCR reaction was performed 
is approximately 400 base pairs in length. To amplify this - as described above. To obtain sufficient quantities of ampli- 
region. the following primer pairs were used. The 5* anti- 10 fication product. 15 identical PCR reactions were performed, 
sense oligonucleotide primer. FT3X. having the nucleotide ^suiting heavy chain fragments beginning at frame- 

sequence 5'-G-CAA-TAA-ACC-CTC-ACT-AAA-GGG-3' wor fc i and extending to the end of CHI and having 
(SEQ ID NO 118). hybridized to the non-coding strand of randomly mutagenized CDR3 regions were approximately 
the heavy chain corresponding to the region 5 of and 79Q base pairs in length. The heavy chain fragment ampli- 
including the beginning of framework 1. The 3' sense 13 fj cat i on products from the 15 reactions were first pooled and 
oligonucleotide primer. B7EFR3. having the nucleotide tnen ge j purified as described above prior jo their incorpo- 
sequence 5'-TCT-CGC-ACA-ATA-ATA-CAC-GGC-3' rat i on i^o a phagemid library. 
(SEQ ID NO 119), hybridized to the coding strand of the b Phagemid Library Construction 

heavy chain corresponding to the 3* end of the framework -3 The resultant gel purified heavy chain fragments prepared 
region. The oligonucleotide primers were synthesized by ^ in Example 6a were then digested with the restriction 
Research Generics (Hunsrville. Ala.). The PCR reaction was enzymes. Xho I and Spe L as described in Example 2d. The 
performed in a 100 ul reaction containing one ug of each of resultant digested heavy chain fragments were subsequently 
oligonucleotide primers FTX3 and B7EFR3. 8 ul 25 mM gcl purified prior to insertion into the pC3 TT7E phagemid 
dNTP's (dXTR dCTP. dGTP. dTTP). 1 ul Taq polymerase. vector clone which was previously digested with the same 
10 ng of template pCE-TT7E. and 10 ul of lOxPCR buffer 25 restriction enzymes to remove the non-mutagenized heavy 
purchased commercially (Promega Biotech). Two drops of cnam fragment and form a linear vector. Ligation of 640 ng 
mineral oil were placed on top of the admixture and 35 of thc ne avy chain Xho I/Spe I fragments having 
rounds of PCR amplification in a thermocycler were per- mutagenized CDR3 regions into two ug of the linearized 
formed. The amplification cycle consisted of denaturing at pC3-TT7E phagemid vector to form circularized vectors 
94 C. for one minute, annealing at 50° C. for one minute. ^ na vi n g mutagenized CDR3 regions was performed over- 
followed by extension at 72° C. for two minutes. The ^glit at room temperature using 10 units of BRL ligase 
resultant PCR amplification products were then gel purified (Gaithersburg. Md.) in BRL ligase buffer in a reaction 
as described in Example Id and used in an overlap extension volume of 150 uL Five separate ligation reactions were 
PCR reaction with the products of the second PCR reaction, performed to increase the size of the phage library having 
both as described below, to recombine the two products into 35 mu tagenized CDR3 regions. Following the ligation 
reconstructed heavy chains containing mutagenized CDR3 reactions, the circularized DNA was precipitated at -20° G. 
regions as illustrated in FIG. 12- - for two hours by the admixture of 2 ul of 20 mg/ml 

The second PCR reaction resulted in the amplification of glycogen. 15 ul of 3M sodium acetate at pH 5.2 and 300 ul 
the heavy chain from the 3' end of framework region 3 Q f et hanol. DNA was then pelleted by microcentrilugation at 
extending to the end of CHI region which is approximately w 40 Q t f or 15 minutes. The DNA pellet was washed with cold 
390 base pairs in length. To aniplify this region, the follow- 7 q<& ethanol and dried under vacuum. The pellet was 
ing primer pairs were used. The 5* anti-sense oligonucleotide resuspended in 10 ul of water and transformed by eiectropo- 
primer pool, designated 7ECDR3, had the nucleotide ration * mio 300 ul of £. coU XLl-Blue cells as described in 
sequence represented by thc formula. Example 2k to form a phage library. Thc total yield from the 

5'-CTC-tax-Tat-TCT^ 45 mutagenesis and transformation procedure described herein 

hTNS-NNS-^s-NNS-NNS-NNS-NNS-NNS-NNS-NNS-NNS- was approximately 5xl0 7 transformants. 

TGG-OGC-CAA-GGG-A CC-ACG-3' After transformation, to isolate phage on which het- 

where N can be A. C G. or T and S is either C or G (SEQ erodimer expression has been induced for subsequent pan- 
ID NO 120) wherein the 5' end of the primer pool is ning on target antigens such as fluorescein. 3 ml of SOC 
complementary to the 3' end of framework 3 represented by 50 medium (SOC was prepared by admixture of 20 g bacto- 
the complemeniary nucleotide sequence of the oUgonucle- tryptone. 5 g yeast extract and 0.5 g Nad xn one liter of 
otide primer B73FR3 and the 3' end of the primer pool is water, adjusting the pH to 7.5 and admixing 20 ml of glucose 
complementary to the 5' end of framework 4. The region just before use to induce the expression of the Fd-cpIH and 
between the two specified ends of the primer pool is repre- light chain heterodimer) was admixed and the culture was 
sented by a 48-mer NNS degeneracy which ultimately 55 shaken at 220 rpm for one hour at 37 G. after which 10 ml 
encodes a diverse population of mutagenized CDR3 regions of SB (SB was prepared by admixing 30 g tryptone, 20 g 
of 16 amino acid residues in length. The 3* sense oligonucle- yeast extract, and 10 g Mops buffer per liter with pH 
otide primer. CGIZ. as described by Persson et aL. supra, adjusted to 7) containing 20 ug/ml carbenicillin and 10 
having the nucleotide sequence ug/ml tetracycline and the admixture was shaken at 300 rpm 
5*-GCATGTACTAGTTTTGTCACAAGATTTGGG-3' 60 for an additional hour. This resultant admixture was admixed 
(SEQ ID NO 121). hybridized to the coding strand of the to 100 ml SB containing 50 ug/ml carbenicillin and 10 ug/ml 
heavy chain corresponding to the 3' end of the CHI. The tetracycline and shaken for one hour, after which helper 
second PCR reaction was performed on the pC3-TT7E in a phage VCSM13 (10 12 pfu) were admixed and me admixture 
100 ul reaction as described above containing one ug of each was shaken for an additional two hours. After this time. 70 
of oligonucleotide primers 7ECDR3 and CG 12. Thc result- 65 ug/ml kanamycin was admixed and maintained at 30° G 
ant PCR amplification product was then gel purified as overnight. The lower temperature resulted in better net- 
described above. erodimer incorporation on the surface of the phage. The 
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supernatant was cleared by centrifugation (4000 rpm for 15 
minutes in a JAlOrotor at 4° C). Phage were precipitated by 
admixture of 4% (w/v) polyethylene glycol 8000 and 3% 
(w/v) NaCl and maintained on ice for 30 minutes, followed 
by centrifugation (9000 rpm for 20 minutes in a JA10 rotor 
at 4° C). Phage pellets were resuspended in 2 ml of PBS and 
nucrocentrifuged for three minutes to pellet debris, trans- 
ferred to fresh tubes and stored at -20° C. for subsequent 
screening as described below. 

For deterrnining the titering colony forming units (cfu). 
phage (packaged phagemid) were diluted in SB and 1 ul was 
used to infect 50 ul of fresh (AOD600=1) E. coli XLI-Blue 
cells grown in SB containing 10 ug/ml tetracycline. Phage 
and cells were maintained at room temperature for 15 
minutes and then directly plated on LB/carbenicillin plates, 
c. Selection of Anti-Flu orescein Heterodimers on Phage 
Surfaces 

1) Multiple Pannings of the Phage Library Having 
Mutagenized CDR3 Regions 

The phage library produced in Example 6b having heavy 
chain fragments with mutagenized CDR3 regions was 
panned as described herein on a microtitcr plate coated with 
a 50 ug/ml fluorescein-BSA conjugate to screen for anti- 
fluorescein heterodimers. Fluorescein was conjugated to 
BSA according to the methods described in "Antibodies: A 
Laboratory Manual", eds Harlow et al.. Cold Spring Harbor 
Laboratory, 1988. 

The panning procedure described was a modification of 
that originally described by Parmley and Smith (Parmley et 
al.. Gene, 73:30-5-318). Two to four. wells of a microliter 
plate (Costar 3690) were coated overnight at 4° C. with 25 
ul of 50 ug/ml antigen prepared above in 0.1M bicarbonate. 
pH 8.6. The wells were washed twice with water and 
blocked by completely filling the well with 3% (w/v) bovine 
serum albumin (BSA) in PBS and incubating Che plate at 37° 
C. for 1 hour. Blocking solution was shaken out, 50 pi of the 
phage library prepared above (typically 10 u cfu) was added 
to each well, and the plate was incubated for 2 hours at 37° 
C. 

Phage were removed and the plate was washed once with 
water. Each well was then washed 10 times withTBS/Tween 
(50 mM Tris-HCl. pH 7.5, 150 mM Nad. 0.5% Tween 20) 
over a period of 1 hour at room temperature — pipetted up 
and down to wash the well, each time allowing the well to 
remain completely ailed with TBSTTween between wash- 
ings. The plate was washed once more with distilled water 
and adherent phage were eluted by the addition of 50 ul of 
elution buffer (0.1M HQ adjusted to pH 22 with solid 
glycine, containing 1 mg/ml BSA) to each well and incu- 
bation at room temperature for 10 minutes. The elution 
buffer was pipetted up and down several times, removed, 
and neutralized with 3 ul of 2M Tris base per 50 ul of elution 
buffer used. Eluted phage were used to infect 2 ml of fresh 
(ODcoo^ 1) £ coli XU-Blue cells for 15 minutes at room 
temperature, after which 10 ml of SB containing 20 ug/ml 
carbenicillin and 10 ug/ml tetracycline was admixed. 
[Aliquot (20. 10. and Vio ul) were removed for plating to 
determine the number of phage (packaged phagemids) that 
were eluted from the plate.) The culture was shaken for 1 
hour at 37° C, after which it was added to 100 ml of SB 
containing 50 ug/ml carbenicillin and 10 ug/ml tetracycline 
and shaken for 1 hour. Then helper phage VCSM13 (10 12 
pfu) were added and the culture was shaken for an additional 
2 hours. After this time, 70 ug/ml kanamycin was added and 
the culture was incubated at 37° C. overnight- Phage prepa- 
ration and further panning were repeated as described above. 
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Following each round of panning, the percentage yield of 
phage must be determined, where % yield — (number of 
phage eluted/number of phage applied) xlOO. 

As an alternative to elution with acid, phage bound to the 

5 wells of the microtiter plate were eluted by admixing 50 ul 
of a solution of 10~ 5 M fluorescein diluted in PBS followed 
by a maintenance period of one hour at 37 C. The solution 
was then pipetted up and down to wash the wells. The 
resultant eluate was transferred to 2 ml of fresh E. coli 

io XU-Blue cells for infection as described above for prepar- 
ing phage and further panning. In subsequent rounds of 
panning, phage were eluted with 10~*M fluorescein. 

The results of the amount of phage that were specifically 
bound to fluoresce in-coated wells over four consecutive 

15 rounds of panning and eluted with acid or with fluorescein 
alone are shown below in Table 8. Comparable yields of 
phage on which heterodimers were expressed that bound 
specifically to fluorescein were achieved with either elution 
protocol. Approximately 20 clones of the 5xl0 7 clones 

20 resulting from the mutagenesis and transformation exhibited 
specificity of binding towards fluoresce in-coated wells. 
These data confirm that mutagenesis of the CDR3 region as 
described in this invention resulted in the altering of a 
heterodimer which initially specifically bound to TT to one 

25 that specifically bound fluorescein. 



TABLE 8 



30 



35 





Phase Eluted 






Acid Ehfljon 


Fluorescein EIudod 


round 1 


5.6 x lOVwell 


4.7 x lOVwell 


round 2 


4.6 x lOVwell 


5.6 x lOVwell 


round 3 


3.75 x lOVwell 


1.35 x 10*/weU 


round 4 


131 x 10*/weU 


4.0 x lOVwell 



2) Preparation of Soluble Heterodimers for 
Characterizing Binding Specificity to Fluorescein 

40 In order to further characterize the specificity of the 
mutagenized heterodimers expressed on the surface of phage 
as described above, soluble heterodimers were prepared and 
analyzed in ELBA assays on fluorescein-coated plates, by 
competive ELISA with increasing concentrations of soluble 

45 fluorcscein-BSA and also by fluorescence quenching assays. 
The latter assays were performed as described in "Fluores- 
cein Hapten: An Immunological Probe", ed E. W. Voss. CRC 
Press. Inc. pp 52-54, 984. 

To prepare soluble heterodimers. phagemid DNA, from 

50 positive clones was isolated and digested with Spel and 
NheL Digestion with these enzymes produces compatible 
cohesive ends. The 4.7-kb DNA fragment lacking the gin 
portion was gel-purified (0.6% agarose) and self-ligated. 
Transformation of E. coli XLI-Blue afforded the isolation of 

55 recombinants lacking the gHI fragment Clones were exam- 
ined for removal of the gin fragment by Xhol/Xbal 
digestion, which should yield an 1.6-kb fragment Clones 
were grown in 100 ml SB containing 50 ug/ml carbenicillin 
and 20 mM Mgd 2 at 37° C until an ODgoo of 0.2 was 

60 achieved. ETG (1 mM) was added and the culture grown 
overnight at 30° C. (growth at 37° C. provides only a light 
reduction in heterodimer yield). Cells were pelleted by 
centrifugation at 4000 rpm for 15 minutes in a J A 10 rotor at 
4° C. Cells were resuspended in 4 ml PBS containing 34 

65 ug/ml pbenylmethylsulfonyl fluoride (PMSF) and lysed by 
sonication on ice (2-4 minutes at 50% duty). Debris was 
pelleted by centrifugation at 14,000 rpm in a JA20 rotor at 
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4* C. for 15 minutes. The supernatant was used directly for 
ELISA analysis and was stored at -20° C. For the study of 
a large number of clones. 10-ml cultures provided plenty of 
hcterodimer for analysis- In this case. sonicatioQS were 
performed in 2 ml of buffer. 

The soluble heterodimers prepared above were assayed by 
ELISA. For this assay. 1 ug/well of fluorescein-BSA solu- 
tion was admixed to individual wells of a microtiter plate 
and maintained at 4° C. overnight to allow the protein 
solution to adhere to the walls of the well. After the 
maintenance period, the wells were washed one time with 
PBS and thereafter maintained with a solution of 3<& BSA to 
block nonspecific sites on the wells. The plates were main- 
tained at 37° C for one hour after which time the plates were 
inverted and shaken to remove the BSA solution. Soluble 
heterodimers prepared above were then admixed to each 
well and maintained at 37° C. for one hour to form an 
immunoreaction products. Following the maintenance 
period, the wells were washed 10 times with PBS to remove 
unbound soluble antibody and then maintained with a sec- 
ondary goat anti-human FAB conjugated to alkaline phos- 
phatase diluted in PBS containing 1% BSA. The wells were 
maintained at 37° C. for one hour after which the wells were 
was 10 times with PBS followed by development with 
p-nitrophenyl phosphate. 

Immunoreactive heterodimers as determined in the above 
ELISA were then analyzed by competition ELISA to deter- 
mine the affinity of the mutagenized heterodimers. The 
ELISA was performed as described above with increasing 
concentrations of soluble fluorescein-BSA ranging in con- 30 
centration from 10 _9 M up to IT*M in concentration 
admixed in the presence of the soluble heterodimers. Maxi- 
mal inhibition of binding was achieved at a concentration of 
10~*M free antigen with a half-maximal inhibition obtained 
with, approximately 10~ 7 M free antigen. Thus, the 35 
mutagenized heterodimers of this invention specifically rec- 
ognize and bind to fluorescein. Additional experiments were 
performed to confirm that the mutagenized heterodimers no 
longer recognized the TT to which they nonmutagenized 
heterodimer originally bound. Fluorescence quenching 40 
assays were also performed to confirm the specificity of 
binding of the mutagenized heterodimers. Soluble het- 
erodimers prepared from phage that were either eluted with 
acid or with fluorescein alone were equally effective at 
binding fluorescein by any of the aforementioned 45 
approaches. The invention of mutagenesis of the CDR3 
region of the heavy chain of a hcterodimer described herein 
thus resulted in the alteration of binding specificity fromTT 
to fluorescein. 

Thus, the above example illustrates a method according to 50 
the present invention for mutagenizing the complementarity 
determining region (CDR) of an irninunogiobtilih gene, and 
also illustrates oligonucleotides useful therefor. 

In one embodiment, therefore, an oligonucleotide is con- 
templated that is useful as a primer in a polymerase chain 55 
re action (PGR) for inducing mutagenesis in a complemen- 
tarity determining region (CDR) of an imnwnoglobulin 
gene. The oligonucleotide has 3' and 5* termini and com- 
prises (1) a nucleotide sequence at its 3* terminus capable of 
hybridizing to a first framework region of an immunoglo- 60 
bulin gene. (2) a nucleotide sequence at its 5* terminus 
capable of hybridizing to a second framework region of ao 
immunoglobulin gene, and (3) a nucleotide sequence 
between the 3' and 5' termini adapted for introducing muta- 
tions during a PCR into the CDR region between the first 65 
and second framework regions of the immunoglobulin gene, 
thereby mutagenizing the CDR region. 
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Insofar as immunoglobulin genes have three CDR regions 
on both the heavy chain and the light chain of an 
immunoglobulin, each separated by a distinctive framework 
region, it is to be understood that the above example is 
5 readily applicable to introducing mutations into a specific 
CDR by selection of the above 5' and 3* nucleotide 
sequences as to hybridize to the framework regions flanking 
the targeted CDR. Thus the above first and second frame- 
work sequences can be the conserved sequences flanking 
CDR1. CDR2 or CDR3 on either the heavy or light chain. 
10 Exemplary and preferred is the CDR3 of the human immu- 
noglobulin heavy chain. 

The length of the 3' and 5' terminal nucleotide sequences 
of a subject mutagenizing oligonucleotide can vary in length 
as is well known, so long as the length provides a stretch of 
13 nucleotides complementary to the target framework 
sequences as to hybridize thereto. In the case of the 3' 
terminal nucleotide sequence, it must be of sufficient length 
and complementarity to the target framework region located 
3* to the CDR region to be mutagenized as to hybridize and 
20 provide a 3* hydroxyl terminus for initiating a primer exten- 
sion reaction. In the case of the 5* terminal nucleotide 
sequence, it must be of sufficient length and complementa- 
rity to the target framework region located 5' to the CDR 
region to be mutagenized as to provide a means for hybrid- 
izing in a PCR overlap extension reaction as described 
above to assemble the complete immunoglobulin heavy or 
light chain. 

Framework regions flanking a CDR are well characterized 
in the immunological arts, and include known nucleotide 
sequences or consensus sequences as described elsewhere 
herein. Where a single, preselected immunoglobulin gene is 
to be mutagenized. the framework-defined sequences flank- 
ing ,a particular CDR are known, or can be readily deter- 
mined by nucleotide sequencing protocols. Where a reper- 
toire of immunoglobulin genes are to be mutagenized. the 
framework-derived sequences are preferably conserved, as 
described elsewhere herein. 

Preferably, the length of the 3' and 5* terminal nucleotide 
sequences are each at least 6 nucleotides in Length, and can 
be up to 50 or more nucleotides in length, although these 
lengths are unnecessary to assure accurate and reproducible 
hybridization. Preferred are lengths in the range of 12 to 30 
nucleotides, and typically are about 18 nucleotides. 

A particularly preferred framework-defined nucleotide 
sequence for use as a 3' terminus nucleotide sequence has the 
nucleotide sequence 5*-TGGGGCCAAGGGACCACG-3* 
(SEQ ID NO 122). 

A particularly preferred framework-defined nucleotide 
sequence for use as a 5* terminus nucleotide sequence has the 
nucleotide sequence 5'-GTGTATTATTGTGCGAGA-3' 
(SEQ ID NO 123). 

The nucleotide sequence located between the 3* and 5' 
termini adapted for mutagenizing a CDR can be any nucle- 
otide sequence, insofar as the novel sequence will be incor- 
porated by the above methods. However, the present 
approach provides a means to produce a large population of 
mutagenized CDR's in a single PCR reaction by the use of 
a population of redundant sequences defining randomized or 
nearly randomized nucleotides in the CDR region to be 
mutagenized. 

A preferred oligonucleotide comprises a nucleotide 
sequence between the above described 3' and 5* termini that 
is represented by the formula: |NNS] A or |NNK]„. wherein 
N can independently be any nucleotide, where S is G or C 
K is G or T, and where n is from 3 to about 24. In preferred 
embodiments the preferred oligonucleotides have the for- 
mula: 
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5'-critjiAnxrTCjTC3COAOAfNNS ltcgggccaagggac- is subjected to PCR conditioos that include a PCR primer 
cacgo* fSEQ n> NO: 124) and 5-GTGTATTATTCrrGC- oligonucleotide as described above constituting the first 

GAGAfNNKLTGGGGCCAAGGGACCACG-3' (SEQ ID NO: , fo . n . im _ ^ 

1 primer in a PCR primer pair as is well known to produce an 

amplified PCR product that is derived from the preselected 

Exemplary and particularly preferred is the oligonucleotide 5 COR region but that includes the nucleotide sequences of 

where with the formula [NNS] n and n is 16, such that the the PCR primer. The second oligonucleotide in the PCR 

oligonucleotide represents a large population of redundant amplifying conditions can be any PCR primer derived from 

oligonucleotide sequences. the immunoglobulin gene lo be mutagenized* as described 

The invention also contemplates a mutagenesis method herein 

te al^g thetauBUDO^ 10 fe d$ oUgonudeotide of Uus 

nologlobulin gene. The method provides direct mutagenesis . . , . . 

in a^reselected CDR of an irnmunoglobulm gene which invcntl0n as desCTibed above * 

comprises subjecting a recombinant DNA molecule (rDNA) The foregoing is intended as illustrative of the present 

containing the cloned immunoglobulin gene having a target invention but not limiting. Numerous variations and modi- 

CDR to PCR conditions suitable for amplifying a prese- 15 fications can be effected without departing from the true 

lected region of the CDR. In the method, the rDNA molecule spirit and scope of the invention. 

SEQUENCE LISTING 

( 1 ) GENERAL INFORMATION: 

(lit) NUMBER OF SEQUENCES: 125 

( 2 ) INFORMATION FOR SEQ ID NO: 1: 

( i ) SEQUENCE CHARACTERISTICS: > 
( A ) LENGTH: 173 base ptirt 
{ B ) TYPE; oockic and 
( C > STRANDEDNESS: double 
( D ) TOPOLOGY: fine* 

{ i i ) MOLECULE TYPE: DNA (gcaomic) 

( i i i ) HYPOTHETICAL: NO ' 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l : 
CGCCOCAAAT TCTATTTCAA OOAOACAOTC ATAATOAAAT ACCTATTOCC TACOOCAOCC 60 
OCTQOATTOT TATTACTCGC TOCCCAACCA OCCATOOCCC AGOTGAAACT CCTCOAGATT 120 
TCTAOACTAO TTACCCOTAC OACOTTCCGO ACTACGOTTC TTAATAOAAT TCO 173 

( 2 ) INFORMATION FOR SEQ ID NO-J: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 173 bmc pm 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ © NO-2: 
TCGACOAATT CTATTAAOAA CCOTAOTCCG OAACOTCOTA COOCT AACTA GTCTAOAAAT 60 
CTCOAOCAOT TTCACCTOOO CCATOOCTOG TTGOOCAOCO AOTAATAACA ATCCAOCOGC 120 
TOCCGTAGGC AATAOOTATT TCATTATGAC TOTCTCCTTO AAATAOAATT TOC 173 



( 2 ) INFORMATION FOR SEQ ID NOO: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 131 base pain 
( B ) TYPE: oneiric acid 
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-continued 



( C ) STRANDEDNESS: double 
( D > TOPOLOGY: finear 

( i i ) MOLECULE TYPE; DMA (genomic) 

(iii > HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOS: 
TOAATTCTAA ACTAOTCGCC AAOOAOACAG TCATAATOAA ATACCTATTO CCTACOOCAG 60 
CCGCTGGATT GTTATTACTC GCTOCCCAAC CAOCCATGGC COAOCTCOTC AOTTCTAGAG 12 0 

TTAAGCOGCC O 



1 3 t 



( 2 ) INFORMATION FOR SEQ ID NO:* 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 139 b»e pairs 
( B ) TYPE: ouckk acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(tit) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:* 
TCOACOOCCO CTTAACTCTA OAACTOACGA OCTCOOCCAT GGCTGGTTGO OCAOCOAOTA 60 
ATAACAATCC AGCGGCTOCC GTAOOCAATA OGTATTT CAT TATOACTOTC TCCTTOOCGA 120 
CTAGTTTAGA ATTCAAOCT 139 

( 2 ) INFORMATION FOR SEQ ID N03: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 21 amino aads 
( B ) TYPE: ammo acid 
( D ) TOPOLOOY: oaknown 

( i i ) MOLECULE TYPE: peptide 

< i i i ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO--5: 

Met Ly » Tyr Leu Leo Pro Thr Alt Ala All Oly Leo Len Leu Leu Ala 
1 5 t0 13 

Ala Ola Pro Ala Mel 

2 0 

{ 2 ) INFORMATION FOR SEQ ID NO*: v 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 «ni»o acids 
( B ) TYPE: amino acid 
( D ) TOPOLOOY: unknown 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( ▼ i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Erwuua carowrcra 

( x i > SEQUENCE DESCRIPTION: SEQ ID NOA 

Met Ly i Tyr Leu L«° Pro Tbr Ata Ala Ala Oly Leu LeuLeu Leu Ala 
i 5 10 > ' 
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-continued 



Ala Ola Pro Ala Gin Pro Ala Me I Ala 

20 . 15 

( 2 ) INFORMATION FOR SEQ ID NO:7: 

( i ) SEQUENCE CHARACTERISTICS: / 
( A ) LENGTH: 22 amino acids. 
( B ) TYPE: amino acid 
( D ) TOPOLOOY: unknown 

(it) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: ErWia carotwror* 

(x i ) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Mel Lyi Sex Leo lie Thr Pro lie Ala Ala Oly Leo Leo Lea Ala Pie 
, 5 10 13 

Sei Ola T y r Ser Leu Ala 

2 0 

( 2 ) INFORMATION FOR SEQ ID NO*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A >LENOTH: 2S aariao ackb 
( B )TYPE: amino acid 
( D ) TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Ly* lie Lyi Tbr Oly Ala Arglle Leu AlaLeu Sot Ala Leu Thr 

1 5 . 1 0 13 

Tbr Met Mel Pbc Scr Ala Sei Ala Leu Ala Lyi lie 

2 0 2 5 

( 2 ) INFORMATION FOR SEQ ID NOA 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 amino acids 
( B ) TYPE: annno acid 
( D )TOPOLOOY: unknown 

< i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOA 

Me t Met Lys At| Aia lie Leu Ala V.I lie Val Pro Ala Lea Leu Val 
\ 5 10 15 

Ala Oly Thr Ala A » n Ala Ala Ola 

2 0 

( 2 ) INFORMATION FOR SEQ H> NO:10: 

{ i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 23 ammo acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: unknown 

( t i ) MOLECULE TYPE: peptide 

< i i i ) HYPOTHETICAL: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Met Lyi Oh Ser Tbr I 1-e All Lou Ala Leo Leu Pro Lett Leu Pbe Tbr 

Pio Val Thi Ly* All Aig Tbr 

2 0 

( 2 ) INFORMATION FOR SEQ ID NO:ll: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 Mjnao acids 
( B ) TYPE; annuo acid 
•(D) TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: peptide 

( i i i ) HYPOTHETICAL: NO 

( » i ) SEQUENCE DESCRIPTION: SEQ ID NOM: 

Met Ser lie Oln Hi* Pbe Arg Val Ala Leo lit Pro Pb« Pbe Ala Ala 
! 5 »0 15 

Pbe Cy* Leu Pro Val Pbe Ala Hit Pro 

•2 0 2 5 

( 2 ) INFORMATION FOR SEQ ID NO: 12: 

( i ) SEQUENCE CHARACTERISTICS: 
.-( A > LENGTH: 27 amino acid* 
( B ) TYPE: annuo acid 
( D ) TOPOLOOY: unknot 

( i i ) MOLECULE TYPE: peptide 

(ill ) HYPOTHETICAL: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Met Met lie Tbt Leu Arg L y » Leu Pto Leu Ala Val Ala Val Ala Ala 

1 5 10 15 



Oly Val Met Ser Ala Oln Ala Met Ala Val Aip 

20 25 



( 2 ) INFORMATION FOR SEQ ID NO: 13: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 «nno «cidj 
( B ) TYPE: amino acid 
( D ) TOPOLOOY: i 



< t i ) MOLECULE TYPE: peptide 
(itt ) HYPOTHETICAL: NO 
( x i > SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Met Lp Ala Tbr Ly. Leu Val Leu Oly Ala Val lie Leu Oly Ser Tbr 
1 s 10 15 



Leu Leu Ala Oly Cy* Ser 
2 0 



< 2 ) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 23 annuo acid 
( B ) TYPE: amino acid 
( D ) TOPOLOOY: « 



( i i ) MOLECULE TYPE: peptide 
( I i i ) HYPOTHETICAL: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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Met Lys Ly i Ser Leu Val Leu Lys Ala Ser Val All Val Ala Thr Leo 
I 3 10 15 

Val Pro Mec Leu Ser Phe Ala 
2 0 

( 2 > INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 18 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: pepbde 

( j i i ) HYPOTHETICAL: NO 

( * i ) SEQUENCE DESCRIPTION: SEQ ID NO-.15: 

Met Lys Lyi Leu Leo Phe Ala lie Pro Leu Val Val Pro Pie Tyr Ser 
1 5 10 13 

H i i Ser 

( 2 ) INFORMATION POR SEQ ID NOU6: 

< i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 211 amino acids 

{ B ) TYPE: amino acid 

( D ) TOPOLOGY: unknown 

( i i ) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL: NO 

( t ) FRAOMENT TYPE: utcrsal 

( t i ) SEQUENCE DESCRIPTION: SEQ ID. NO-.16: 

Pro Phe Val Cy» O I « Tyr Oln Oly Gin Oly Oln Ser Ser Aip Leu Pro 
! 5 ' 10 13 

Oln Pro Pro Val Asn Ala Oly Oly Gly Ser Oly Gly Gly Ser Oly Oly 

2 0 2 3 3 0 

Gly Sei Olo Gly Oly Oly Ser OU Gly Gly GlySer Gin Oly Oly Gly 
35 40 -45 

Ser Ola Oly Oly Oly Ser Oly Oly Gly Ser Oly Ser Oly Asp Phe Aip 
5 0 5 5 « 0 

Tyr Olo Lys Mel Ala A i o Ala Asa Lys Oly Ala Met Thr Olu A»& Ala 

65 70 75 80 

AipOlo A * n Ala Leu Olo S • r Asp Ala Lys Gly Lys Leu Asp Ser Val 
8 5 9 0 9 5 

Ala Thr Asp Tyr Oly Ala Ala lie Asp Oly Phe lie Oly Asp Val Ser 
10 0 10 5 110 

Oly Leo Ala Asn Oly Asn Gly Ala Th x Gly Aip Phe Ala Oly Ser Asn 
t 1 3 12 0 12 5 

Ser Oln Met Ala Gin Val Gly Asp Gly Asp Ain Ser Pro Leo Met Asn 
13 0 13 5 14 0 

Asa-Pbe A r g Gin Tyr Leo Pro Ser Leo Pro 01a Ser Val Olo Cys Arg 
145 130 155 160 

Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr Gla Phe Ser lie Asp Cys 
16 5 17 0 17 5 

Asp Lys lie Asn Leo Phe Arg Oly ViJ Phe Ala Phe Leo Leo Tyr Val 
18 0 18 5 19 0 

Ala Tbr Phe Met Tyr Va] P b c Ser Thr Phe Ala Asn lie Leo Arg Asn 
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L J % O 1 o S e i 
2 1 0 



( 2 ) INFORMATION FOR SEQ ID NO 17: 



( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 50 aanao acids 
{ B ) TYPE: MtdM acid 
( D ) TOPOLOGY: unkncwn 



( i i ) MOLECULE TYPE: f 
(iit ) HYPOTHETICAL: NO 
( x i ) SEQUENCE DESCRIPTION: SEQ CD NO: 17: 

Ala Olu OTy A.p Asp Pro All Lyi Ala Ala Pic A * n Ser Leu Ola Ala 

I 5 10 l.J 

Ser Ala Thi Olu Tyi lie Otj Tyi Ala Trp Al* MotVal V.l V,| lie 

2 0 2 5 30 

Val Gly Ala Tbr lie Gly lie Lys Leo Pbe Ly* Lyi Phe Tlr Ser Lyi 
35 40 43 

Ala S a r 

5 0 

( 2 ) INFORMATION FOR SEQ ID NO- 18: 

( t ) SEQUENCE CHARACTERISTICS; 

( A > LENGTH: 30 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: RNA (scaomic) 

( i i i ) HYPOTHETICAL; NO 

(if) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:lB: 
AAUCUUGOAO OCUUUUUUAU OOUUCOUUCU - ^ 

( 2 ) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pair* 
( B ) TYPE: aockie acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: Hacar 

. { i i > MOLECULE TYPE: RNA (Rename) 

~ < i i i > rTYPOTHETICAL: NO 

( .1 » > ANTtSENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
UAACUAAOOA UOAAAUGCAU GUCU.AAGACA 

( 2 ) INFORMATION FOR SEQ ID NO :20: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 30 ba»e paws ( 

( B ) TYPE: noclric acid 

( C ) STRANDEDNESS: doable 

< O ) TOPOLOGY: nnear 

( i i ) MOLECULE TYPE: RNA (gemomic} 



( i i i ) HYPOTHETICAL: NO 
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( i » ) ANTI-SENSE; NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NOl20: 
UCCUAGGAGO UUUOACCUAU OCCAOCUUUU 



3 0 



( 2 ) INFORMATION FOR SEQ ID NO:21: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE: axkic acid 
( C ) STRANDEDNESS: doable 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: RNA (genomic) 

{ t i i ) HYPOTHETICAL: NO 

< i y ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N021 : 
AUOUACUAAG OAGGUUOUAU OGAACAACOC 



3 0 



( 2 ) INFORMATION FOR SEQ ID NO:22: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 *ase paw 
~ ( B ) TYPE: nucleic acid 

( C ) STRANDEDNESS: single 
( D > TOPOLOGY: naear 

{ i i ) MOLECULE TYPE: DNA (gcaoxmc) 

( i i i ) HYPOTHETICAL: NO 

{ i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N022: 
OGCCGCAAAT TCTATTTCAA GGAOACAOTC AT 



( 2 ) INFORMATION FOR SEQ ID NO-J3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 hoc pain 
( B ) TYPE: aockic acid 
( C ) STRANDEDNESS: nigle 
( D ) TOPOLOOY: Hocar 

{ i i ) MOLECULE TYPE: DNA (gcooobc) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCK23: 
AATGAAATAC CTATTGCCTA COGCAOCCOC TGOATT 



( 2 ) INFORMATION FOR SEQ ID N0^4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 baw pain 
f B ) TYPE: aockic acid 
{ C ) STRANDEDNESS: sagfe 
{ D ) TOPOLOGY: fiaear 

( i i ) MOLECULE TYPE: DNA {genomic) 

( V i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NCk24: 
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OTTATTACTC GCTOCCCAAC CAGCCATOGC CC 



( 2 ) INFORMATION FOR SEQ ID NOn25: 

( i > SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 29 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: imgle 
( D ) TOPOLOGY: Use* 

( i i ) MOLECULE TYPE: DNA (gcaotnk) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x L ) SEQUENCE DESCRIPTION: SEQ ID N025: 
CAOTTTCACC TOOOCCATGO CTOOTTOOG 



2 ) INFORMATION FOR SEQ ID NCrJtf: 

) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 40 base pain 
( B ) TYPE: soddc acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: Bncar 

) MOLECULE TYPE: DNA (gcoconc) 

) HYPOTHETICAL: NO 

) ANTI-SENSE: NO 

) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
CAOCOAOTAA TAACAATCCA CCOGCTOCCO TAOOCAATAO 



(.2 ) INFORMATION FOR SEQ ID NO:27: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 38 base pain 
( B ) TYPE: aoclek acid 
( C XSTRANDEDNESS: sto«k 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i 1 i ) HYPOTHETICAL: NO 

< i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOZ7: 
OTATTTCATT ATGACTOTCT CCTTGAAATA OAATTTGC 



( 2 ) INFORMATION FOR SEQ ID N028: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 40 base pass 

< B ) TYPE : aickic acid 

( C ) STRANDEDNESS: ringjc 
( D )TOPOLOOY:Enear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( it ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N028: 
AOOTOAAACT OCTCOAOATT TCTAOACTAO TTACCCOTAC 



{ 2 ) INFORMATION FOR SEQ ID N029: 



85 



5.759.817 



-continued 



86 



( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 38 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: riagk 
( D ) TOPOLOGY: Hflear 

( i i ) MOLECULE TYPE: DNA (genoeme) 

(tit > HYPOTHETICAL: NO 

< i * ) ANTI-SENSE: NO 

(xt) SEQUENCE DESCRIPTION: SEQ ID NCK29: 
CGOAACCTCG TACOOOTAAC TAOTCTAOAA ATCTCOAO 

( 2 ) INFORMATION FOR SEQ ID NO JO: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 haw pain 
( B ) TYPE: noekic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Knear 

( i j ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( i y > ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NO JO: 
OACOTTCCOO ACTACGOTTC TTAATAGAAT TCO 

( 2 ) INFORMATION FOR SEQ ID NOJl: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 2S base pm 
{ B ) TYPE: noekic acid 
( C ) STRANDEDNESS: uogk 
( D > TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i t i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ED NOJl: 
TCOACOAATT CTATTAAOAA CCOTAGTC 

( 2 ) INFORMATION FOR SEQ ID NOJl: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pain 
( B ) TYPE: auclek acid 
( C ) STRANDEDNESS: unglc 
( D ) TOPOLOGY: Hour 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N032: 
TOAATTCTAA ACTAOTCOCC AAOOAOACAO TCAT 



< 2 ) INFORMATION FOR SEQ ID NO-J3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pain 
( B ) TYPE: nockk jcid 
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( C ) STRANDEDNESS: angle 
( D ) TOPOtOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic). 

( i i i ) 'HYPOTHETICAL: NO 

( i v > ANTI-SENSE: NO 

(i i ) SEQUENCE DESCRIPTION: SEQ ID NOCJ3: 
AATOAAATAC CTATTGCCTA COOCAOCCOC TOO AT T 



( 2 ) INFORMATION FOR. SEQ ID r40"34: 

( i ) SEQUENCE CHARACTERISTICS: . 
( A ) LENOTH: 31 base pain 
( B ) TYPE: nockic acid 
( C ) STRANDEDNESS: lingJe 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*34: 
GTTATTACTC GCTOCCCAAC CAOCCATGGC C 



( 2 ) INFORMATION FOR SEQ ID NOJ3: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 30 base pairs 
( B ) TYPE: nsckic acid 
( C ) STRANDEDNESS: single 
( D >TOPOLOOY: fine* 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL; NO 

( t v ) ANTI-SENSE: NO 

( ii ) SEQUENCE DESCRIPTION: SEQ ID N035: 
OAGCTCOTCA OTTCTAOAGT TAAOCOOCCO 



( 2 ) INFORMATION FOR SEQ ID N006: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 48 ba*e pairs 

( B ) TYPE: cock* acid 

( C ) STRANDEDNESS: single 

( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-J6: 
OTATTTCATT ATOACTGTCT CCTTGGCGAC TAOTTTAOAA TTCAAGCT 



( 2 > INFORMATION FOR SEQ ID NO-37: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 40 ba*c pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: ungk 
( D ) TOPOLOOY: fine* 



( i i ) MOLECULE TYPE: DNA (genomic) 
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( i i i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NOJ7: 

CAOCOAOTAA taacaatcca ocgoctgccg taggcaata 



( 2 ) INFORMATION FOR SEQ ID NCh3B: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 27 base pan 
i B >TYPE: nucleic add 
( C ) STRANDEDNESS: ringle 
( D ) TOPOLOGY: baur 

r i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

{ i v ) ANTI-SENSE: NO 

( 1 i ) SEQUENCE DESCRIPTION: SEQ ID NO-38: 
TOACOAOCTC GOCCATOGCT GGTTGGO 



( 2 ) INFORMATION FOR SBQ ID NO-J* 

( L ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 24 base pm 

< B ) TYPE: mckic aod 

< C ) STRANDEDNESS: single 

< D ) TOPOLOOY: fine* 

( t i ) MOLECULE TYPE: DNA (genomic) 
( j i k ) HYPOTHETICAL: NO 
( i » ) ANTI-SENSE: NO 

( x i > SEQUENCE DESCRIPTION: SEQ ID N009: ' 
TCOACGOCCG CTTAACTCTA OAAC 



( 2 ) INFORMATION FOR SEQ ID NO:40: 

< j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 haac pais 
( B >TYPE: nucleic mad 
( C > STRANDEDNESS: mgje 
( D >TOPOLOGY: fiae* 

( i i ) MOLECULE TYPE: DNA (geocanc) 

( i i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
AGGTSMARCT ICTCOAOTCW OG 



( 2 ) INFORMATION FOR SBQ ID NO:41: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 

< B ) TYPE: wefcic acid 

< C ) STRANDEDNESS: angle 
( D ) TOPOLOGY: haot 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iti ) HYPOTHETICAL: NO 

ANTI-SENSE: NO 
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r x i ) SEQUENCE DESCRIPTION: SEQ ID NO*l: 
AOGTCCAOCT OCTCOAGTCT GO 



( 2 ) INFORMATION FOR SEQ ID NO:42: 

( t ) SEQUENCE CHARACTERISTICS: 
( A ) LENOTH: 22 base pain 
( B ) TYPE: nucleic acid 
< C ) STRANDEDNESS: tingle 
{ D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

< i t i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID HOA2: 
AOOTCCAOCT OCTCGAOTCA OO 



( 2 ) INFORMATION FOR SEQ ID NOt43: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 22 ba*c pain 
( B >TYPE: nucleic sod 
( C ) STRANDEDNESS: ringfe 
{ D ) TOPOLOGY: li»*ar 

< i i ) MOLBCULE TYPE: DNA {genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
AOGTCCAOCT TCTCOAOTCT OO 

( 2 ) INFORMATION FOR SEQ ID NO*4: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 22 base pain 
( B ) TYPE: noclac acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: fine* 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
AOOTCCAOCT TCTCOAOTCA OO. 



( 2 ) INFORMATION FOR SEQ ID NO-.4S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base prin 
( B ) TYPE: oockic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

(si) SEQUENCE DESCRIPTION: SEQ ID WMSi 



AOOTCCAACT OCTCOAGTCT OO 
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( 2 ) INFORMATION FOR SEQ ID NCM6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: uuclbc acid 
( C > STRANDEDNESS: ringk 
( D ) TOPOLOOY: Hoeaf 

( 1 i ) MOLECULE TYPE: DNA (genomic) 

( i i > ) HYPOTHETICAL: NO 

( it ) ANTI-SENSE: NO 

f x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.46: 
AOGTCCAACT OCTCOAOTCA OG 

( 2 ) INFORMATION FOR SEQ ID NO:47: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: oDcldc aod 
( C ) STRANDEDNESS: ungfe 
( D ) TOPOLOOY: Encar 

( i i ) MOLECULE TYPE; DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
AOOTCCAACT TCTCOAOTCT OO 

( 2 ) INFORMATION FOR SEQ ID NO:4»: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: nx\ac arid 
( C ) STRANDEDNESS: fingic 
( D ) TOPOLOOY: tines 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NOr4»: 
AOGTCCAACT TCTCGAOTCA GG 

( 2 ) INFORMATION FOR SEQ ID NOt«9: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: cuclek add 
{ C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: bocx 

< i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:«9: 
AOOTNNANCT NCTCOAOTCW OO 



( 2 ) INFORMATION FOR SEQ ID NOJO: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pain 
{ B ) TYPE : nucleic acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOOY: tine* 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ i i i ) HYPOTHETICAL: NO 

( t v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOJO: 
GCCCAAGOAT GTOCTCACC 



( 2 ) INFORMATION FOR SEQ ID NCn51: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 39 base pain 

< B ) TYPE: nucleic acid 

( C ) STRANDEDNESS: single 
( D JTOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOt5l: 
CTATTAGAAT TCAACOOTAA CAOTGGTOCC TTGGCCCC A 



( 2 ) INFORMATION FOR SEQ ID NCh32: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 38 base pain 
(B > TYPE: nucleic arid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: finear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i » ) ANTI-SENSE: NO 

{ x i > SEQUENCE DESCRIPTION: SEQ ID NOJ2: 
CTATTAACTA OTAACOOTAA CAOTQOTOCC TTOCCCCA 



( 2 ) INFORMATION FOR SEQ ID NOJ3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pans 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: angle 
( D ) TOPOLOGY: Kacar 

( i i > MOLECULE TYPE: DNA {genomic) 

( t I i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOS3: 
CTCAGTATGO TOOTTGTGC 



( 2 ) INFORMATION FOR SEQ ID NOJ4: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 26 base pea* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
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( D >TOPOLOOY: line* 
( i i ) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL: NO 
( i » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-54: 
OCT ACT ACT T TTGATTTCCA CCTTGO 



( 2 ) INFORMATION FOR SEQ ID N053: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: Knot 

( i i ) MOLECULE TYPE: DNA (genomic) 

( t i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NOS5: 
CAOCCATOGC CGACATCCAO ATO 



{ 2 ) INFORMATION FOR SEQ ID NO*J6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: linear 

(t i ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOt56: 
AATTTTACTA OTCACCTTOO TOCTOCTOOC 



( 2 ) INFORMATION FOR SEQ ID NOJ7: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 39 base pain 
( B ) TYPE: noelek acid 
{ C ) STRANDEDNESS: ungk 
( D ) TOPOLOOY: linear 

{ i i ) MOLECULE TYPE: DNA (genomic) 

( a i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOJ7: 
TATGCAACTA OTACAACCAC AATCCCTOGG C AC A AT T TT 



( 2 ) INFORMATION FOR SEQ ID N038: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base p«n 
{ B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sngk 
< D ) TOPOLOGY: linear 



2 6 



( i i ) MOLECULE TYPE: DNA (genomic) 
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{ i i i ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

( * i ) SEQUENCE DESCRIPTION: SEQ CD NO--58: 
AOOCTTACTA OTACAATCCC TOOOCACAAT 



( 2 ) INFORMATION FOR SEQ ID NO.59: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain . 
f B ) TYPE: nucleic acid 
(. C ) STRANDEDNESS: Mn«k 
< D ) TOPOLOGY: line* 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iit ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x t ) SEQUENCE DESCRIPTION: SEQ ID N039: 
CCAOTTCCOA GCTCOTTGTO ACTCAOGAAT CT 



( 2 ) INFORMATION FOR SEQ ID NO«h 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pmn 
( B ) TYPE: ooclck acid 
( C ) STRANDEDNESS: lingle 
( D )TOPOLOOY:fincar 

( i i > MOLECULE TYPE: DNA (genomic) 

( t i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOrfO: 
CCAOTTCCOA OCTCOTOTTO ACOCAGCCOC CC 



( 2 ) INFORMATION FOR SEQ ID NOrft: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base jmn 
( B )TYPE: node* acid 
( C ) STRANDEDNESS: smgk 
( D ) TOPOLOGY: linear 

( t i > MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOr6l: 
CCAOTTCCOA OCTCOTGCTC ACCCAOTCTC CA 



< 2 ) INFORMATION FOR SEQ ID N0^2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
( B ) TYPE: aockic acid 
( C ) STRANDEDNESS: lmgk 
( D ) TOPOLOGY: tinear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 



( i t ) ANTI-SENSE: NO 
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( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
CCAOTTCCOA OCTCCAOATO ACCCAOTCTC CA 

( 2 ) INFORMATION FOR SEQ ID HCh63: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 32 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fine* 

( i i ) MOLECULE TYPE: DNA (genomic) 

< i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SBJUENCE DESCRIPTION: SEQ H> NO£3: 
CCAGATGTOA GCTCOTOATG ACCCAOACTC CA 



( 2 ) INFORMATION FOR SEQ ID NO:64: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) l^NOTH: 32 base pun 
( B ) TYPE: mideic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: finear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*4: 
CCAOATOTOA OCTCOTCATG ACCCAOTCTC CA 

( 2 ) INPORMATION FOR SEQ ID NOr65: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pain 
< B ) TYPE: aockic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i > MOLECULE TYPE: DNA (genome) 

( 1 i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ff> NO*5: 
CCAOATOTOA OCTCTTQATO ACCCAAACTC AA 

< 2 ) REFORMATION FOR SEQ ID NOrf6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pan 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D )TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ j i i ) HYPOTHETICAL: NO 

{ i y ) ANTI-SENSE: NO 

' ( x i ) SEQUENCE DESCRIPTION: SEQ ID NOrf6: 

CCAGATGTOA OCTCOTGATA ACCCAOOATO AA 
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( 2 ) INFORMATION FOR SEQ ID NCh67: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pair* 

< B ) TYPE: nucleic acid 

( C > STRANDEDNESS: «mgk 

< D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 
. ^ fiii ) HYPOTHETICAL: NO 
( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOKS7: 
OCAOCATTCT AGAGTTTCAO CTCCAOCTTO CC 



( 2 ) INFORMATION FOR SEQ ID NCh68: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 base jmn 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: iiagk 
( D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

< i v ) ANTI-SENSE: NO 

< x i ) SEQUENCE DESCRIPTION: SEQ ID NOt6»: 
CCOCCOTCTA OAACACTCAT TCCTGTTOAA OCT 



( 2 ) INFORMATION FOR SEQ ID NO*»: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: ringk 
( D ) TOPOLOGY: linear 

( i i > MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

(x i) SEQUENCE DESCRIPTION: SEQ ID NO** 
CCOCCOTCTA O A AC AT TC TO CAGGAGACAO ACT 



( 2 ) INFORMATION FOR SEQ ID NO: TO: 

( i ) SEQUENCE CHARACTERISTICS: 
( A j LENGTH: 32 base pais 
( B ) TYPE: nu cl eic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOOY: bur 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

(xx) SEQUENCE DESCRIPTION: SEQ ID NOVK): 
CCAOTTCCGA OCTCOTOATO ACACAOTCTC CA 



( 2 ) INFORMATION FOR SEQ ID NO-.71: 

( i ) SEQUENCE CHARACTERISTICS: 
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( A ) LENGTH: 34 ba*o pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
< D )TOPC>LCOY: linear 

( i i > MOLECULE TYPE: DNA (genocide) 

f j i i ) HYPOTHETICAL: NO 

( i * > ANTI-SENSE: NO 

<*x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.71 : 
GCOCCOTCTA OAATTAACAC TC A TTCCTCT T G A A 



( 2 ) INFORMATION FOR SEQ ID NO: 72: 

( i ) SEQUENCE CHARACTERISTICS: 
f A ) LENGTH: 3» base pairs 
( B ) TYPE: xucksc acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: fine* 

{ i i ) MOLECULE TYPE: DNA (geooabc) 

( i i i ) HYPOTHETICAL: NO 

( i v } ANTI-SENSE: NO 

( * i ) SEQUENCE DESCRIPTION: SEQ ID NO:72: 
CTATTAACTA OTAACGGTAA CAOTOGTOCC TTGCCCCA 



( 2 ) INFORMATION FOR SEQ ID NO: 73: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 30 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: «mgle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i y ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 73 : . 
ACOCTT ACTA OTACAATCCC TOOOCACAAT 



( 2 ) INFORMATION FOR SEQ IT> NO:74: 

{ i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 29 base pain 
( B ) TYPE: ODclek acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ i i i ) HYPOTHETICAL: NO 

( i » > ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
OCCOCTCTAO AACACTCATT CCTOTTGAA 



( 2 ) INFORMATION FOR SEQ ID NO:75: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: EDcJeic tad 
( C ) STRANDEDNESS: tingfe 
( D ) TOPOLOGY: Kaear 
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( i t ) MOLECULE TYPE: DNA (genomic) 
(iii ) HYPOTHETICAL: NO 
( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
AOOTNNANCT NCTCGAOTCT GC 



( 2 ) INFORMATION FOR SEQ ID NO:76: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pam 
( B ) TYPE: ruckic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fiaear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

< i ▼ ) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 
AOOTNNANCT NCTCOAGTCA GC 



( 2 ) INFORMATION FOR SEQ ID NOT7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pun 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i y ) ANTI-SENSE: NO 

( I i ) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
OTOCCAGATO TGAOCTCOTO ATOACCCAGT CTCCA 



( 2 ) INFORMATION FOR SEQ ID NO:78: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pain 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: siagk 
{ D ) TOPOLOGY: linear 

( i t ) MOLECULE TYPE: DNA (genomic) 

(iit ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:7»: 
TCCTTCTAOA TTACTAACAC TCTCCCCTOT T O A A 



( 1 ) INFORMATION FOR SEQ ID NaT* 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pain 
( B ) TYPE: oDckic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOOY: Enesr 

< i i ) MOLECULE TYPE: DNA (genocne) 



3 5 



3 4 



( i i i ) HYPOTHETICAL: NO 
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f i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQID NO: 79: 
OCATTCTAOA CTATTATOAA CATTCTOT AO OOOC 

( 2 ) INFORMATION FOR SEQ ID NO:80: 

( j ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 37 base pm 
{ B ) TYPE: mclac acid 
( C ) STRANDEDNESS: single 
{ D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA {gcaxmc) 

( i i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.80: 
CTOCACAGOG TCCTOGGCCO AOCTCOTOOT GACTCAO 

( 2 ) INFORMATION FOR SEQ ID NO-SI : 

( i ) SEQUENCE CHARACTERISTICS: t 
( A ) LENGTH: 22 b**e pans 
( B ) TYPE: nockic acid 
( C ) STRANDEDNESS: ragle 
(D)TOPOLOOY:Snear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

(li) SEQUENCE DESCRIPTION: SEQ ID NO--81:. 
AONTOCANNT OCTCOAGTCT GO 

{ 2 ) INFORMATION FOR SEQ ID NO:82: 

( i ) SEQUENCE CHARACTERISTICS: 
. ( A ) LENGTH: 43 ba»e pain 
( B )TYPE: oockac add 
{ C > STRANDEDNESS: angle 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (gcacmic) 

(iii) HYPOTHETICAL: NO 

(it) ANTVSENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ CD NO:82: 
OTOOOCATOT OTOAOTTOTO TCACTAOTTO OOGTTTTGAO CTC 

( 2 ) 1NPORMATTON FOR SEQ ID NO: 83: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base pain 
( B ) TYPE: auckac add 
( C ) STRANDEDNESS: nogfe 
( D ) TOPOLOGY: fiaear 

( i i ) MOLECULE TYPE: DNA (geaenmc) 

(iii ) HYPOTHETICAL: NO 

f i v ) ANTI-SENSE: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.83: 
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AOCATCACTA GTACAAGATT TOGOCTC 

( 2 ) INFORMATION FOR SEQ ID NO&4: 

( i ) SEQUENCE CHARACTERISTICS : 
( A ) LENGTH: 23 base pair* ■ 

< B ) TYPE: ouclck acid 

( C ) STRANDEDNESS: tm&c 
. < D ) TOPOLOGY: finear 

■ ( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.84: 
AOOTOCAGCT OCTCGAGTCT OO 

( 2 ) INFORMATION FOR SEQ ID NO:85: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pairs 
( B ) TYPE: nucleic •cid 
( C ) STRANDEDNESS: uagte 

< D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 
< i i i ) HYPOTHETICAL: NO 
( i ▼ > ANTI-SENSE: NO 

( x i > SEQUENCE DESCRIPTION: SEQ ID NO:85: 
AOOTOCAGCT OCTCOAGTCO OO 

( 2 ) INFORMATION FOR SEQ ID NO-.86: 

< i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 22 base pairs 
( B ) TYPE: DDclck acid 
( C ) STRANDEDNESS: tmgk 
( D ) TOPOLOGY: Hnear . 

( i i ) MOLECULE TYPE; DNA (genomic) . 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTkSENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCfc96: 
AGOTOCAACT GCTCGAOTCT OO 

( 2 ) INFORMATION FOR SEQ ID NO:a7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B )TYPE: oxide acid 
( C ) STRANDEDNESS: sagfe 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-J7: 
AOGTOCAACT OCTCOAGTCO OO 
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( 2 ) INFORMATION FOR SEQ ID NO: 88: 

, ( i ) SEQUENCE CHARACTERISTICS: 

( A > LENGTH: 3* bate pan 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: line* 

( i i ) MOLECULE TYPE; DNA (gcaocrbc) 

( i t i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 

( x \ ) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
TCCTTCTAOA TTACTAACAC TCTCCCCTGT TGAA 



( 2 ) INFORMATION FOR SEQ ID NO:«9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 37 base pm 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fines 

( i i ) MOLECULE TYPE: DNA (genome) 

< i i i ) HYPOTHETICAL: NO 

( t v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-J9: 
CTGCACAGGO TCCTOOOCCO AOCTCOTOOT GACTCAG 



( 2 ) INFORMATION FOR SEQ ID NO30: 

f i ) SEQUENCE CHARACTERISTICS: 
. ( A ) LENGTH: 31 base pairs 
( B ) TYPE: nucleic nod 
( C ) STRANDEDNESS: ringte 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOSO: 
GCATTCTAOA CTATTAACAT TCTGTAGOOO C 



. < 2 ) INFORMATION FOR SEQ ID NOt9I : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base psir* 
( B ) TYPE: noetic acid 
( C ) STRANDEDNESS: Mfigte 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genocne) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE : NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N031 : 
ACCCAAGGAC ACCCTCATG 



( 2 ) INFORMATION FOR SEQ ID NCW2: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pain 
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( B ) TYPE: nucleic Acid 

( C ) STRANDEDNESS: single 

( D ) TOPOLOGY: lines 

( i i ) MOLECULE TYPE: DNA (*«»mk) 

( i i i ) HYPOTHETICAL: NO 

< i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCH92: 
CTCAOTATOO TOOTTOTOC 



( 2 ) INFORMATION FOR SEQ ID N093: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 ba» pain 
( B )TYPE: nucleic acid 
( C > STRANDEDNESS: na^e 
( D ) TOPOLOGY: linear 

{ i i ) MOLECULE TYPE: DNA (gcrcmic) 

< t i i ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N053: 
OTCTCACTAO TCTCCACCAA OOOCCCATCO OTC 



< 2 ) INFORMATION FOR SEQ ID NOS*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 b»e pain 
( B ) TYPE: nucjdc and 
{ C > STRANDEDNESS: xa&e 
( D )TOPOLOOY:fiocar 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( l i ) SEQUENCE DESCRIPTION: SEQ ED NQ3W: 
ATATACTAGT GAOACAOTOA CCAGGGTTCC TTGOCCCC 



( 2 ) INFORMATION FOR SEQ ID t*>95: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 base pain 
< B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fiaear 

( i i ) MOLECULE TYPE: DNA (genomic) . 

< i i i ) HYPOTHETICAL: NO 

< i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOS5: 
ACGTCTAOAT TCCACCTTOG TCCC 



( 2 ) INFORMATION FOR SEQ ID N036: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 31 base pain 
( B ) TYPE: ouclcic acid 
( C ) STRANDEDNESS: nagJe 
( D ) TOPOCOOY: Baeai 
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< t i ) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL: NO 
{ i v > ANTI-SENSE: NO 

( * i ) SEQUENCE DESCRIPTION: SEQ ID NOS6: 
OCATAC7AOT CTATTAACAT TCTOTAOGGG C 

( 2 ) INFORMATION FOR SEQ ID NO:97: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base padn 
( B , ) TYPE: noekk acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i j ) MOLECULE TYPE: DNA (genomic) 

( t i i > HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO . ~" 

(ii ) SEQUENCE DESCRIPTION: SEQ ED N037: 

CCOOAATTCT TATCATTTAC CCGOAOA " 2 7 

( 2 ) INFORMATION FOR SEQ ID N0^8: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 29 base pain 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOO Y: line* 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOS8: 
TCTOCACTAO TTOOAATOGO CACATOCAG 19 

( 2 ) INFORMATION FOR SEQ ID NC*99: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 79ft bow pairs 
( B ) TYPE: ancleic acid 
( C ) STRANDEDNESS: dnaVk 
( D ) TOPOLOGY: tmwr 

(i i ) MOLECULE TYPEi'DNA (gaxxsac) _ 

( i i i ) HYPOTHETICAL; NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOSfc 

OOCCOCAAAT TCTATTTCAA OGAGACAOTC ATAATOAAAT ACCTATTOCC TACGOCAGCC 60 

OCTOGATTOT TATTACTCOC TGCCCAACCA OCCATOGCCC AOOTGAAACT GCTCOAOTCA 120 

OOACCTGGCC TCOTOAAACC TTCTCAOTCT CTOTCTCTCA CCTOCTCTOT CACTOACTAC l»0 

TCCATCACCA OTOCTTATTA CTOOAACTGO ATCCOOCAOT TTCCAOOAAA CAAACTOGAA 240 

TGOATOOGCT ACATAAOCTA CGACOOTGTC AATAAOTATO ATCCATCTCT CAAOAATCGA 300 

ATCTCCATCA CTCOTGACAC ATCTAACAAT CAGTTTTTCC AOAAO TTO AT TTCTOTOACT 360 

TCTOAOGACA CAOOAACATA TGACTGTTCA AO A O OO A CT A OOGCCTCTOC TATOOACTAC 420 

TGOOOT CAAG OAATTTCAOT CACCOTCTCC TCAOCCAAAA COACACCCCC aTCTGTCTAT 480 
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CCACTOOCCC 


CTOGATCTGC 


TOCCC AA ACT 


AACTCCATOG 


TGACCC TOGO 


ATOCCTOGTC 


5 4 0 


A AOGGCT ATT 


TCCCTOAOCC 


AO TOACAGTG 


ACCTOG AAC T 


CTGGATCCCT 


O TCCAOCOGT 


6 0 0 


OTOC AC AC CT 


TCCCAGCTOT 


CCTOCAOTCT 


O ACC TCT AC A 


C TCTO AOC AG 


CTCACTOACT 


6 6 0 


GTCCCCTCCA 


OCCCTCGGCC 


C AOCO AGACC 


GTC ACCTOCA 


ACGT TOCCCA 


CCCOOCCAOC 


7 2 0 


AOCACCAAGO 


TOO AC A AG AA 


AATTGTOCCC 


AOGG AT TGT A 


CT ACT T ACCC 


GTACGACG TT 


7 & 0 


CCOO AC T ACG 


GTTCTTAA . 










7 9 8 



( 2 ) INFORMATION FOR SBQ ID NO:10Ot 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 194 base pars 
( B ) TYPE: nucleic jcid 
( C > STRANDEDNESS: doable 
( D ) TOPOLOGY: boat 

( i i I MOLECULE TYPE: DNA (genomic) 

(jit ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE : NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
TGAATTCTAA ACTAGTCGCC AAOOAOACAO TCATAATOAA ATACCTATTG CCTACGOCAG 
CCOCTOOATT OTTACTCOCT GCCCAACCAG CCATOGCCOA GCTCCAOATO ACCCAOTCTC 
CAOCCTCCCT ATCTOCATCT OTOOOAOAAA CTGTCACCAT CACATOTCOA TCAAOTOAOA 
ATATTACAAT TACT 

( 2 ) INFORMATION FOR SEQ ID NO: 101: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 333 buc pain 
( B ) TYPE: nucleic acid 
{ C ) STRANDEDNESS: double 
( D )TOPOLOOY:liae* 

( J i ) MOLECULE TYPE: DNA (gcnoaac) 

( i i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



CTOATOCTOC 


ACC AACTOTA 


TCCATCTTCC 


C ACC ATCC AO 


TOAOC AO TTA 


AC ATCTOOAG 


6 0 


OTOCCT CAGT 


COTGTOCTTC 


TTGAAC A ACT 


TCT ACC CCA A 


AOACT ACAAT 


OTCAAOOOOA 


1 2 0 


AGATTOATOO 


C AGTOAACOA 


CAAAATOOCO 


TCCTOAAC AG 


TTGOAC TGAT 


CAOOAC AOCA 


1 S 0 


AAOACAOC AC 


CTAC AOCATO 


AOCAOCACCC 


TCACCTTOAC 


C AAGO ACOAO 


TA TOA ACOAC 


2 4 0 


AT AACAOCT A 


T ACCTGTOAT 


GCCACTCACA 


AGACATC AAC 


TTC ACCC ATT 


OTCAAOAOCT 


3 0 0 


TCAAC AOGA A 


TOAOTOTTAA 


TTCTAO ACOO 


CGC 






3 3 3 



{ 2 ) INFORMATION FOR SEQ ID NO:IOZ- 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 150 base pax* 
( B ) TYPE: ondeic add 
( C ) STRANDEDNESS: double 
{ D ) TOPOLOGY: Saac 

( j i ) MOLECULE TYPE: DNA (geaomk) 



( i i i ) HYPOTHETICAL: NO 
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( i * ) ANTI-SENSE: NO 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
OCTOAOOGTO aCOATCCCGC AAAAGCOQCC 
TATATCCOTT ATGCGTOOOC OATCOTTGTT 
CTOTTTAAGA AATTC ACCTC OAAAGCAAGC 



( 2 ) INFORMATION FOR SEQ ID NO-.103: 

t i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 36 ba*c pain 
( B J TYPE: nucleic acid 
< C ) STRANDEDNESS: ibfile 
( D ) TOPOLOGY: finear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(iii ) HYPOTHETICAL: NO 

< i t > ANTI-SENSE: NO 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO-.103: 
OTOCCCAOOO ATTOTACTAO TOCTOAOOOT GACOAT 



( 2 ) INFORMATION FOR SEQ ID NO: 104: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 base pain 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: tagfe 
( D ) TOPOLOGY: ftsev 

( i i ) MOLECULE TYPE DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( t » ) ANTI-SENSE: NO 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO:I04t 
ACTCOAATTC TATCAOCTTO CTTTCOAOOT OAA 



( 2 ) INFORMATION FOR SEQ ID NO. 105: 

( t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 heac ps» 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D )TOPOLOOY: linear 

(ii ) MOLECULE TYPE: DNA (genoaac) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
AGGTCCAGCT TC TCOAGTC T GO 



( 2 ) INFORMATION FOR SEQ ID NO-.I06: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 33 ha*; pain 
( B ) TYPE: oockac acid 
( C ) STRANDEDNESS: tingle 
{ D )TOPOLOOY:Iiaear 

< i i ) MOLECULE TYPE: DNA (scam) 



TTTAACTCCC TOCAAGCCTC AGCOACCOAA 60 
OTCATTOTCO GCGCAACTAT COGTATCAAG 120 

I 3 0 



< i i i > HYPOTHETICAL: NO 
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( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO. 106: 
OTCACCCTCA OCACTAOTAC AATCCCTOOO CAC 



i 2 > INFORMATION FOR SEQ ID NO: 107: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 48 base pairs 
( B ) TYPE: nuclei* acid 
( C ) STRANDEDNESS: usfde 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ i i i ) HYPOTHETICAL: NO 

( i r ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
OAOACOACTA GTGGTGCCGG TOOCTCTCCA TTCOTTTOTG A AT A T C A A 



( 2 ) INFORMATION FOR SEQ ID NO: 108: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 40 base pairs 
( B ) TYPE: tuckk acid 
( C ) STRANDEDNESS: .ingle 
( D ) TOPOLOGY: hoc* 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ i i i ) HYPOTHETICAL: NO 

( i v ) ANT) -SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TTACTAGCTA OCATAATAAC OGAATACCCA AAAGAACTGG 



( 2 ) INFORMATION FOR SEQ ID NO. 109: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pairs 
( B ) TYPE: oockac acid 
( C ) STRANDEDNESS: angle 
( D >TOPOLOOY: Haear 

( i i ) MOLECULE TYPE: DNA-(ge«xiik) 

( i » i ) HYPOTHETICAL: NO 

1 ( i v ) ANTI-SENSE: NO 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

TATOCTAOCT AGTAACACGA CAOGTTTCCC OACTOO 



( 2 ) INFORMATION FOR SEQ ID NO:ll(h 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 baw pair* 
( B ) TYPE: mckic acid 
( C ) STRANDEDNESS: lingfc 
< D ) TOPOLOGY: Imc* 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

(it) ANTI-SENSE: NO 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: HO: 
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ACCOAGCTCO AATTCOTAAT CATOOTC 

{ 2 > INFORMATION FOR SEQ ID NO:lll: 

( ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 186 base pair* 



( C ) STRANDEDNESS: i 
( O ) TOPOLOGY: linear 


bt*lc 










( t i ) MOLECULE TYPE: DNA (genomic) 










( i i i ) HYPOTHETICAL: NO 












( i v ) ANTI-SENSE: NO 












( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 










GTOCCCAGGO ATTGTACTAO 


TOC TOAGGGT 


OACOATCCCO 


C A AAAGCOGC 


CTTTAACTCC 


6 0 


CTOCAAGCCT CAOCOACCGA 


ATATATCOGT 


TATGCOTGGG 


CGATOGTTOT 


TO TC A TT GT C 


1 2 0 


GOCOCAACTA TCGOT ATCAA 


OCTGTTTAAO 


AAATTCACCT 


CO AAAGCAAO 


CTGATAOAAT 


1 8 0 


TCGAOT 










I 8 6 


( 2 ) INFORMATION FOR SEQ ID NO: 112: 












( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 666 base pain 
( B ) TYPE: mckx add 
( C ) STRANDEDNESS: doable 
( D ) TOPOLOGY: linear 










( i i ) MOLECULE TYPE: DNA (gemnic) 










(lit ) HYPOTHETICAL: NO 












( i v ) ANTI-SENSE: NO 












( z i ) SEQUENCE DESCRIPTION: SEQ ID NOH2: 










CCATTCOVTT OTOAATATCA 


AGOCCA AOOC 


CAAT CGTCTO 


ACCTOC C TCA 


ACCTCCTOTC 


6 0 


AATOCTOOCO OCOGCTCTGO 


TGOTOOTTCT 


GOTOOCOOCT 


CTOAOGOTOG 


TOOCTCTOAO 


1 2 0 


OOTOOCGOTT CTOAOGOTOG 


CGOCTCTGAO 


OOAGOCGOT T 


CCGGTOOTGG 


CTC TOO T TCC 


1 8 0 


OOTGATTTTG ATTATOAAAA 


O ATOGC AA AC 


OCT A AT AAOO 


OGGCTATGAC 


COAAAATOCC 


2 4 0 


OATGAAAACG COCTACAOTC 


TGACGCTAAA 


OOCAAACTTG 


AT TC TGTCGC 


TACTGATT AC 


3 0 0 


GOTOCTOCTA TCOATOOTTT 


CATTGOTGAC 


OTTTCCOOCC 


T TOC T A A TOO 


TAATOOT OCT 


3 6 0 


ACTGOTOATT TTCCTOOCTC 


TAATTCCCAA 


ATOGCTC AAG 


TCOOTOACGO 


TO AT A AT T CA 


4 2 0 


CCTTTAATOA ATAATTTCCO 


TC AAT AT TT A 


CCTTCCCTCC 


CTCA ATCOOT 


TOAATOTCOC 


4 8 0 


CCTTTTOTCT TTAOCOCTGO 


TAAACCATAT 


OAATTTTCTA 


TTOATTOTGA 


CAA AAT AAAC 


5 4 0 


TTATTCOGTO TCTTTOCOTT 


TCTTTTATAT 


OTTOCCACCT 


TTATGTATOT 


ATTTTCT ACQ 


6 0 0 


TTTOCTAACA TACTOCOTAA 


T A AGO AOTCT 


T AAT CATOCC 


AOTTCTTTTO 


OOTATTCCOT 


6 6 0 


T ATT AT 










6 6 6 



< 2 ) INFORMATION FOR SEQ ID NO. 113: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: "TOS ba»e paki 
( B ) TYPE: oucfcac acid 
( C ) STRANDEDNESS: AxiJc 
( D ) TOPOLOGY: fine* 

( i i ) MOLECULE TYPE: DNA (gcacratc) 



( i i i ) HYPOTHETICAL: NO 
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6 0 
I 2 0 

1 S 0 

2 4 0' 



( i v ) ANT! -SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO. 113: 

OAOACOACTA GTGOTOOCOO TOGCTCTCCA TTCOTTTQTG AATATCAAOO CCAAGOCCAA 

TCOTCTOACC TGCCTCAACC TCCTOTCAAT GCTGOCOGCO OCTCTGGTOO TOOTTCTGOT 

GGCOGCTCTG aCOGTOGTGG CTCTOAOOOT GGCGOTTCTG AGOGTOGCOO CTCTGAGGOA 

GOCGOTTCCG OTOCTOOCTC TGGTTCCGGT GATTTTOATT ATOAAAAGAT OOCAAACGCT 

AATAAGOOOG CTATOACCOA AAATOCCGAT CAAAACOCOC TACAGTCTGA COCTAAAGGC 300 

AAACTTOATT CTOTCGCTAC TGATTACGGT OCTGCTATCO ATOGTTTCAT TOOTGACGTT 360 

TCCGGCCTTG CTAATOOTAA TGGTOCTACT GGTOATTTTG CTGGCTCTAA TTCCCAAATG 420 

OCTCAAOTCG GTGACGGTGA TAATTCACCT TTAATGAATA A T T T C C O T C A A T A T T T A C C T 

TCCCTCCCTC AATCGGTTGA ATOTCOCCCT TTTOTCTTTA OCGCTOGTAA ACCATATOAA 

ttttctattg attotgacaa AATAAACTTA ttccotooto tctttgcott tcttttatat 

OTTOCCACCT TTATOTATOT ATTTTCTACG T TTGCTAACA TACTOCGTAA TAAGGAGTCT 
TAATCATOCC AOTTCTTTTG OOTATTCCOT TATTATOCTA GCTAOTAA 

( 2 ) INFORMATION FOR SEQ ID NO:tl4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 201 base pakf 
< B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DMA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i y ) ANTI-SENSE: NO 

( I i ) SEQUENCE DESCRIPTION: SEQ ID NO-.U4: 
TATGCT AOCT AOTAACACOA caggtttccc gactggaaag CGOGCAOTOA OCOCAACGCA 
ATTAATOTOA GTTAGCTCaC TCATTAOOCA CCCCAOOCTT TACACTTTAT OCTTCCOGCT 
COTATOTTOT otooaattot GAOCGOATAA CAATTTCACA CAOGAAACAO CTATOACCAT 
OATTACOAAT TCGAGCTCOO T 



4 8 0 

5 4 0 

6 0 0 

6 6 0 

7 0 8 



6 0 
1 2 0 

1 ft 0 

2 0 1 



( 2 ) INFORMATION FOR SEQ IDNO:! 15: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 830 bas* pak» 
( B >TYPE: oockk acid 
( C ) STRANDEDNESS: doubk 
( D ) TOPOLOGY: Enear 

< i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( t » ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
AOOTCCAOCT TCTCOAGTCT GOACCTOOCC TCOTOAAACC TTCTCAOTCT CTOTCTCTCA 
CCTGCTCTOT CACTOACTAC TCCATCACCA GTGCTTATTA CTOGAACTOO ATCCOOCAGT 
TTCCAOOAAA CAAACTOGAA TOGATOOOCT ACATAAOCTA COACOOTOTC AATAAGTATO 
ATCCATCTCT CAAOAATCOA ATCTCCATCA CTCOTOACAC ATCTAACAAT CAOTTTTTCC 240 
AGAAOTTGAT TTCTGTOACT TCTGAOOACA CAOOAACATA TOACTOTTCA AOAGOOACTA 300 



6 0 
1 2 0 
I 8 0 
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OOOCCTCTGC 


T ATOGACTAC 


TOGGG TC A AG 


G A AT TTCAOT 


CACCOTCTCC 


TCAUtL AAAA 


3 6 0 


CGACACCCCC 


ATCTGTCTAT 


CCACTOOCCC 


C TOO ATC TOC 


TOCCCAAACT 




4 2 0 


TO ACCCTGOO 


ATOCCTGOTC 


AAOOO CT ATT 


TCCCTOAGCC 


AOTOACAOTO ' 




4 8 0 


CTOGATCCCT 


OTCCAOCOOT 


G TOC A CACCT 


TCCC AOCTOT 


CCTOCAOTCT 


O ACCTCT AC A 


5 4 0 


C T CT 0 AG C AO 


CTCAOTOACT 


OTCCCCTCCA 


GCCCT COGCC 


C AOCOAOACC 


otcacctgca 


6 0 0 


ACGTTOCCC A 


CCCOOCCAGC 


AGCACCAAGO 


TOOACAAOAA 


A ATTGTOCCC 


AOGGATTGTA 


6 6 0 


CT AOTOCTO A 


OOGTGACOAT 


CCCOC AAA AO 


COOCCTTTAA 


CTCCCTGCAA 


GCCTCAGCGA 


7 2 0 


CCGAAT AT AT 


C GO T T A TGCO 


TOGOCOATOO 


T TOTTGTCAT 


T OTCOOCOCA 


ACT ATCOGT A 


7 ft 0 


TCAAGCTGTT 


T AAOAAAT TC 


ACCTCGAAAO 


CA AOCTGAT A 


GA ATTCOAGT 




8 3 0 



( 2 ) INFORMATION FDR SBQ ID N0116; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 260 base pairs 
( B ) TYPE: cockic acid 
( C ) STRANDEDNESS: double 
( D )TOPOU5GY:fiDear 

( i j ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

■ ■ (ii ) ANTI-SENSE: NO 



(i i ) SEQUENCE DESCRIPTION: SEQ ID NO: 11 6: 



ATG AAATACC . 


. TATTGCCTAC 


OGC AGCCOCT 


GOATTOTTAT 


T ACTCGCTGC 


CCAACCAGCC 


6 0 


ATOOCCCAGO 


TO AAACTGCT 


CO AG A T T T CT 


AOACT AOTOC 


TOAOOOTGAC 


GATCCCOC AA 


1 2 0 


AAGCOOCCTT 


TAACTCCCTO 


CA AGCCTCAO 


CGACCOAATA 


T ATCGGTTAT 


GCGTOOOCOA 


1 8 0 


TOGTTOTTOT 


CATTOTCGGC 


GCAAC TATCO 


OTATCAAOC T 


OT TTAAGAAA 


TTCACCTCOA 


2 4 0 


AAOCAAOCTO 


ATAGAATTCG 










2 6 0 



( 2 ) INFORMATION FOR SBQ ID NO.l 17: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 461 b*»* pair* 
{ B ) TYPE: Dpcfcie acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOOY: tinwr 

( i i ) MOLECULE TYPE: DNA (genomic) 

(lii ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

OTACOCGCCC TGTAOCOOCG CATTAAGCOC GGCOOOTOTG OTOOTTACOC OCAOCGTGAC 

COCTACACTT OCCAGCGCCC TAGCOCCCOC TCCTTTTGCT TTCTTCCCTT CCTTTCTCGC 

CACOTTCGCC GGCTTTCCCC OTCAAOCTCT AAATCGGOOO CTCCCTTTAO OOTTCCOATT 

TAGTOCTTTA COOCACCTCG ACCCCAAAAA ACTTOATTAO OOTOATGOTT CACGTAOTOO 

OCCATCOCCC TGATAGACOO TTTTTCGCCC TTTOACOTTO OAOTCCACOT TCTTTAATAO 

TOOACTCTTG TTCCAAACTG OAACAACACT CAACCCTATC TCOOTCTATT CTTTTOATTT 

ATAAOOOATT TTOCCOATTT COOCCTATTO GTTAAAAAAT OAOCTOATTT AACAAAAATT 

TAACOCOAAT TTTAACAAAA TATTAACOTT TACAATTTAA A 
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{ 2 ) INFORMATION FOR SEQ ID NO: US: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base psir» 
( B ) TYPE: oucletc acid 
{ C ) STRANDEDNESS: Jtngie 
( 0 ) TOPOLOGY: linear 

{ t i ) MOLECULE TYPE: DNA (genomic) 

(Hi ) HYPOTHETICAL: NO 

( i y ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
OCAATAAACC CTCACTAAAO GO 

( 2 ) INFORMATION FOR SEQ ID NO: 119: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 21 base pain 
( B ) TYPE: oocloc acid 
( C ) STRANDEDNESS: im«k 
( D ) TOPOLOOY: hnear 

< i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ED NO: 11 9: 
TCTCOCACAA TAATACACOO C 

( 2 ) INFORMATION FOR SEQ ID NO: 120: 

( i > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 84 base pain 
( 8 ) TYPE: aocioc acid 
( C ) STRANDEDNESS: tingle 
< D ) TOPOLOOY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

(xL) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
OTOTATTATT OTOCOAOANN SNNSNNSNNS NNSNNSNNSN NSNNSNNSNN SNNSNNSNNS 
NNSNNSTGGG OCCAAGOOAC CACO 



6 0 
8 4 



( 2 ) INFORMATION FOR SEQ ID NO: 121: 

( i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH: 30 base pair* 
( B ) TYPE: auckic acid 
( C ) STRANDEDNESS: *ingk 
' ( D ) TOPOLOGY: Hoear 

( i i ) MOLECULE TYPE: DNA (£«i«nic) 

(ill ) HYPOTHETICAL: NO 

( i t ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NO:I21: 
OCATOTACTA OTTTTOTCAC AAGAT TTGOO 



{ 2 ) INfDRMATION FOR SEQ ID NO: 122: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: IS base pain 
( B ) TYPE: rxiciac acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 
(tit ) HYPOTHETICAL: NO 

( i ▼ ) ANTI-SENSE: NO 
' { x i ) SEQUENCE DESCRIPTION: SEQ ID NO:122: 
TOGOGCCAAG GOACCACG 



( 2 ) INFORMATION FOR SEQ ID NO-.123: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 18 base pairs 
( B )TYPE: mickic acid 
( C ) STRANDEDNESS: wajJc 
< D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i i i ) HYPOTHETICAL: NO 

o ( i ▼ ) ANTI-SENSE: NO 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOM23: 

GTGTATTATT GTOCOAGA 



( 2 ) INFORMATION FOR SEQ ID NO-.124: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 haw pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: smgfc 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genoxmc) 

( i i i ) HYPOTHETICAL: NO 

( i v ) ANTI-SENSE: NO 

( i x ) FEATURE: 

( A ) NAME/KEY: rcpa*_regk» 
( B ) LOCATION: 19.-21 

( D ) OTHER INFORMATION: Apt_rype^ta»dan- 

/ mjucTNNS can be repeated from 3 to about 24 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:124: 
GTGTATTATT GTGCGAOANN STOOGGCCAA OOGACCACO 



( 2 ) INFORMATION FOR SEQ ID NO.125: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: fiaear 

( i i ) MOLECULE TYPE: DNA (geoccric) 

( i i i ) HYPOTHETICAL: NO 

( i * ) ANTI-SENSE: NO 

( i i ) FEATURE: 

( A ) NAME/KEY: repeat_J*g*e« 
( B ) LOCATION: 19.31 

( D ) OTHER INFORMATION: /rpcjypes^andem** 
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/ oott^-NNK can be repeated from 3 to about 24 

( * i ) SEQUENCE DESCRIPTION; SEQ ID NO: 125: 
OTGTATTATT OTCCGACaNN KTGGGCCCAA OOOACCACO 



What is claimed is: 10 
1. A method for producing a library of complementarity 
deterirjining region (CDR>mutagenized phage-displayed 
immunoglobulin heterodimers. the method comprising the 
steps of: 

1) amplifying a CDR portion of a template immunoglo- i 5 
bulin variable domain gene selected from the group 
consisting of a template immunoglobulin heavy chain 
variable domain gene and a template immunoglobulin 
light chain variable domain gene, wherein said tem- 
plate immunoglobulin heavy and light chain genes M 
have a framework region and said CDR portion and 
encode respective heavy and light chain variable 
domain polypeptides, and wherein said amplifying is 
by polymerase chain reaction (PCR) using a PCR 
primer oligonucleotide for mutageoizing a preselected 25 
nucleotide region in said CDR portion, thereby forming 

a library of amplified CDR-mutagenized immunoglo- 
bulin gene fragments, said PCR primer oligonucleotide 
having 3 f and 5' tennini and comprising: 

a) a nucleotide sequence at the 3' terminus capable of ^ 
hybridizing to a first framework region of said 
selected template immunoglobulin variable domain 
gene; 

b) a nucleotide sequence at the 5' terminus capable of 
hybridizing to a second framework region of said 35 
selected template immunoglobulin variable domain 
gene; and 

c) a nucleotide sequence between the 3* and 5' termini 
according to the formula selected from the group 
consisting of: 40 

[NNSI„ and [NNKU 
wherein N is independently any nucleotide. S is G or C and 
K is G or T, and n is 3 to about 24. the 3* and 5 T terminal 
nucleotide sequences having a length of about 6 to 50 45 
nucleotides, or an oligonucleotide having a sequence 
complementary thereto; 

2) inserting individual members of the library of amplified 
CDR-mutagenized immunoglobulin gene fragments 
formed in step (1) into a dicistronic phagemid expres- 50 
sion vector comprising immunoglobulin heavy and 
light chain variable domain genes that lack the immu- 
noglobulin gene portion corresponding to the fragment 

to be inserted, wherein upon insertion said vector is 
capable of expressing heavy and light chain variable 55 
domain polypeptides encoded by said vector, thereby 
forming a library of dicistronic expression vectors 
containing amplified CDR-mutagenized immunoglo- 
bulin gene fragments; and 

3) expressing said irnraunoglobulin heavy and light chain 60 
genes in the library of dicistronic expression vectors 
formed in step (2) whereby said encoded heavy and 
light chain variable domain polypeptides assemble on 
the surface of a phage to form a phage-displayed 
immunoglobulin heterodimer. thereby producing a 63 
library of CDR-mutagenized phage-displayed immu- 
noglobulin heterodimers. 



2. The method of claim 1 wherein said 3' terminus has the 
nucleotide sequence 5'-TGGGGCCAAGGGACCACG-3* 
(SEQ ID NO 122). or an oligonucleotide having a sequence 
complementary thereto. 

3. The method of claim 1 wherein said 5' terminus has the 
nucleotide sequence 5^GTGTATTATTGTGCGAGA-3' 
(SEQ ID NO 123) or an oligonucleotide having a sequence 
complementary thereto. 

4. The method of claim 1 wherein said template immu- 
noglobulin heavy and light chain genes are obtained from a 
human. 

5. The method of claim 1 wherein said CDR portion is 
CDR3. 

6. The method of claim 1 wherein said formula is: 

5*-GTOTATTATTGTGCGAGA|NT^S] n TGGGGCCAAGGGAC - 
CACG-3" (SEQ ID NO 124> 

7. The method of claim 1 wherein n is 16 in the formula 
|NNS]„ (SEQ ID NO 120). 

8. The method of claim 1 wherein said formula is: 

S'-^nXrrATrArTGTCCGAGAfN^ 
CACG-J (SEQ ID NO 125). 

9. A method for producing a complementarity determin- 
ing region (CDR)-mutagenized phage-displayed immuno- 
globulin heterodimer with altered antigen binding 
specificity, the method comprising the steps of: 

1) amplifying a CDR portion of a template immunoglo- 
bulin variable domain gene selected from the group 
consisting of a template immunoglobulin heavy chain 
variable domain gene and a template immunoglobulin 
light chain variable domain gene, wherein said tem- 
plate immunoglobulin heavy and light chain genes 
have a framework region and said CDR portion and 
encode respective heavy and light chain variable 
domain polypeptides having a preselected antigen 
binding specificity to a first antigen, and wherein said 
amplifying is by polymerase chain reaction (PCR) 
using a PCR primer oligonucleotide for mutagenizing a 
preselected nucleotide region in said CDR portion to 
alter said preselected antigen binding specificity, 
thereby forming a library of amplified CDR- 
mutagenized immunoglobulin gene fragments, said 
PCR primer oligonucleotide having 3' and 5' termini 
and comprising: 

a) a nucleotide sequence at the 3* terminus capable of 
hybridizing to a first framework region of said 
selected template immunoglobulin variable domain 
gene; 

b) a nucleotide sequence at the 5' terminus capable of 
hybridizing to a second framework region of said 
selected template immunoglobulin variable domain 
gene; and 

c) a nucleotide sequence between the 3' and 5' termini 
according to the formula selected from the group 
consisting of: 

fNNSL aid [NNKU 
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wherein N is independently any nucleotide. S is G or C and 
K is G or T. and d is 3 to about 24. the 3* and 5' terminal 
nucleotide sequences having a length of about 6 to 50 
nucleotides, or an oligonucleotide having a sequence 
complementary thereto; 5 

2) inserting individual members of the library of amplified 
CDR-mutagenized immunoglobulin gene fragments 
formed in step (1) into a dicistronic phagemid expres- 
sion vector comprising immunoglobulin heavy and 
light chain variable domain genes that lack the immu- 10 
noglobulin gene portion corresponding to the fragment 

to be inserted, wherein upon insertion said vector is 
capable of expressing heavy and light chain variable 
domain polypeptides encoded by said vector, thereby 
forming a library of dicistronic expression vectors 15 
containing amplified CDR-mutagenized immunoglo- 
bulin gene fragments; 

3) expressing said immunoglobulin heavy and light chain 
genes in the library of dicistronic expression vectors 
formed in step (2) whereby said encoded heavy and 
light chain variable domain polypeptides assemble on 
the surface of a phage to form a phage-displayed 
immunoglobulin heterodimer, thereby producing a 
library of CDR-mutagenized phage-displayed immu- ^ 
noglobulin heterodimers; and 

4) irnmunoreacting members of the library of CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimers formed in step (3) on a preselected second 
antigen, said second antigen being different than said ^ 
first antigen to allow for selection of a CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimer with altered antigen binding specificity. 

10. The method of claim 9 wherein said 3* terminus has 
the nucleotide sequence 5 ? -TGGGGCCAAGGGACCACG- 35 
3' (SEQ ID NO 122). or an oligonucleotide having a 
sequence complementary thereto. 

11. The method of claim 9 wherein said 5 ? terminus has 
the nucleotide sequence 5 f -GTGTATTArTGTGCGAGA-3' 
(SEQ ID NO 123) or an oligonucleotide having a sequence ^ 
complementary thereto. 

12. The method of claim 9 wherein said template immu- 
noglobulin heavy and light chain genes are obtained from a 
human. 

13. The method of claim 9 wherein said CDR portion is 45 
CDR3. 

14. The method of claim 9 wherein said formula is 
y^jitjiAnAnrjrocGAGArNNSU 

CACG-r (SEQ ID NO 124). 

50 

15. The method of claim 9 wherein n is 16in the formula 
[NNS]„ (SEQ ID NO 120). 

16. The method of claim 9 wherein said formula is: 

y^ntrrxnxiTCKXGAGAp^ 

CACG-3* (SEQ ID NO 125). 55 

17. A method for producing a soluble complementarity 
determining region (CDR>rautagenized immunoglobulin 
heterodimer with altered immunorcactivity to a preselected 
antigen, the method comprising the steps of: 60 

1) amplifying a CDR portion of a template immunoglo- 
bulin variable domain gene selected from the group 
consisting of a template immunoglobulin heavy chain 
variable domain gene and a template immunoglobulin 
light chain variable domain gene, wherein said tern- 65 
plate immunoglobulin heavy and light chain genes 
have a framework region and said CDR portion and 
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encode respective heavy and light chain variable 
domain polypeptides immunoreactive with a prese- 
lected antigen, and wherein said amplifying is by 
polymerase chain reaction (PCR) using a PCR primer 
oligonucleotide for mutagenizing a preselected nucle- 
otide region in said CDR portion to alter said immu- 
noreactiviry of said immunoglobulin heterodimer to the 
preselected antigen, thereby forming a library of ampli- 
fied CDR-mutagenized immunoglobulin gene 
fragments, said PCR primer oligonucleotide having 3* 
and 5' termini and comprising: 

a) a nucleotide sequence at the 3* terminus capable of 
hybridizing to a first framework region of said 
selected template immunoglobulin variable domain 
gene; 

b) a nucleotide sequence at the 5' terminus capable of 
hybridizing to a second framework region of said 
selected template immunoglobulin variable domain 
gene; and 

c) a nucleotide sequence between the 3' and 5* termini 
according to the formula selected from the group 
consisting of: 

(NNSU and (NNX^. 

wherein N is independently any nucleotide. S is G or C and 
K is G or T, and n is 3 to about 24. the 3' and 5' terminal 
nucleotide sequences having a length of about 6 to 50 
nucleotides, or an oligonucleotide having a sequence 
complementary thereto; 

2) inserting individual members of the library of amplified 
CDR-mutagenized immunoglobulin gene fragments 
formed in step (1) into a dicistronic phagemid expres- 
sion vector comprising immunoglobulin heavy and 
light chain variable domain genes that lack the immu- 
noglobulin gene portion corresponding to the fragment 
to be inserted, wherein upon insertion said vector is 
capable of expressing heavy and light chain variable 
domain polypeptides encoded by said vector, thereby 
forming a library of dicistronic expression vectors 
containing amplified CDR-mutagenized immunoglo- 
bulin gene fragments; 

3) expressing said immunoglobulin heavy and light chain 
genes in the library of dicistronic expression vectors 
formed in step (2) whereby said encoded heavy and 
light chain variable domain polypeptides assemble on 
the surface of a phage to form a phage-displayed 
immunoglobulin heterodimer. thereby producing a 
library of CDR-mutagenized phage-displayed immu- 
noglobulin heterodimers; 

4) irnmunoreacting members of the library of CDR- 
mutagenized phage-displayed immunoglobulin het- 
erodimers produced in step (5) on said preselected 
antigen to isolate an immunoglobulin heterodimer hav- 
ing altered immunoreactivity; 

5) isolating said immunoreacted CDR-mutagenized 
phage-displayed immunoglobulin heterodimer 
obtained in step (4); 

6) producing a soluble form of said immunoreacted form 
of immunoreacted CDR-mutagenized phage-displayed 
immunoglobulin hetexodiroer isolated in step (5); and 

7) assaying said soluble form of immunoreacted CDR- 
mutagenized immunoglobulin heterodimer prepared in 
step (6) to identify a CDR-mutagenized immunoglo- 
bulin heterodimer with altered immunoreactivity to the 
preselected antigen. 

18. The method of claim 17 wherein said assaying is 
determined by an increase in affinity to the preselected 
antigen. 
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19. The method of claim 18 wherein the affinity is greater 
than 10*M~ 1 dissociation constant (KJ. 

20. The method of claim 17 wherein said 3* terminus has 
the nucleotide sequence 5*-TGGC3GCCAAGGGACCACG- 
y (SEQ ID NO 122). or an oligonucleotide having a 
sequence complementary thereto. 

21. The method of claim 17 wherein said 5 f terminus has 
the nucleotide sequence 5'-GTGTATTATTGTGCGAGA-3' 
(SEQ ID NO 123) or an oligonucleotide having a sequence 
complementary thereto. 

22. The method of claim 17 wherein said template immu- 
noglobulin heavy and light chain genes are obtained from a 
human. 
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23. The method of claim 17 wherein said CDR portion is 
CDR3. 

24. The method of claim 17 wherein said formula is 

5'-GTGTXnATTGTGCGAGA|NNS |„TGGGGCCAAGGGAC- 
CACG-3' (SEQ ID NO 124> 

25. The method of claim 17 wherein n is 16in the formula 
iNNS]„ (SEQ ID NO 120). 

26. The method of claim 17 wherein said formula is: 

S'-GTTGTATTXITCnXX^G AG A(NNK ] W TGGGGCCAAGGG 
CACG-3" (SEQ ID NO 125). 

***** 
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ABSTRACT T-cell receptors (TCRs) are membrane an- 
chored heterodimers structurally related to antibody mole- 
cules. Single-chain antibodies can be engineered by linking the 
two variable domains, which fold properly by themselves. 
However, proper assembly of the variable domains of a human 
TCR (V a and V p ) that recognize the HLA-DR2b/myelin basic 
protein-(85-99) peptide complex was critically dependent on 
the addition of a third domain, the constant region of the TCR 
P chain (Cp), to the single-chain construct. Single-chain mol- 
ecules with the three-domain design, but not those with the 
two-domain design, expressed in a eukaryotic cell as chimeric 
molecules linked either to glycosyl phosphatidylinositol or to 
the transmembrane/cytoplasmic domains of the CD3 f chain 
were recognized by a conformation-sensitive monoclonal anti- 
body. The chimeric three-domain single-chain TCR linked to 
CD3 f chain signaled in response to both the specific HLA- 
DR/peptide and the HLA-DR/superantigen staphylococcal 
enterotoxin B complexes. Thus, by using this three-domain 
- design, functional single-chain TCR molecules were expressed 
with high efficiency. The lipid-linked single-chain TCR was 
solubilized by enzymatic cleavage and purified by affinity 
chromatography. The apparent requirement of the constant 
domain for cooperative folding of the two TCR variable 
domains may reflect significant structural differences between 
TCR and antibody molecules. 



T-cell receptor (TCR) recognition of antigen fragments pre- 
sented by major histocompatibility complex (MHC) mole- 
cules is a critical step in the initiation of a specific immune 
response (1, 2). The TCR a and 0 chains are each composed 
of two immunoglobulin-like domains; most of the amino acid 
residues that are found to be highly conserved in the variable 
(V) region of immunoglobulins are also found in TCR V 
regions, suggesting that the tertiary structure of the TCR may 
resemble that of immunoglobulins (3,4). However, TCR V 
regions have significantly more primary sequence variability, 
an increased apparent rate of divergence in phylogeny, and 
peaks of variability in addition to those rioted in immuno- 
globulins (5, 6). In order to understand and control the 
molecular interactions underlying T-cell recognition of 
MHC/peptide complexes; complete structural knowledge of 
the TCR is required. 

Several approaches have been employed to produce sol- 
uble, recombinant TCRs. In these recombinant TCR mole- 
cules, the transmembrane/cytoplasmic regions of a and p 
chains were replaced with sequences from lipid-linked pro- 
teins (7), the CD3 f chain (8), or immunoglobulins (9, 10). 
~ Soluble TCRs were either recovered as secreted proteins or 
obtained by enzymatic cleavage of the surface-expressed 
recombinant proteins. All of these approaches rely, however, 
on the assembly of the heterodimer, which is inefficient (11). 
In addition, high-level expression of the human TCR a chain 
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in transfected eukaryotic cells is not stable. These problems 
can be avoided by the design of a single-chain (sc) recombi- 
nant protein in which the V regions of the heterodimer are 
joined by a short peptide linker. Such a design has been 
successfully applied to antibody molecules (12). Such recom- 
binant molecules, scFv, have a specificity and affinity similar 
to that of native antibodies (12). Several reports have de- 
scribed the production of scTCRs in bacterial expression 
systems using the sc antibody (Fv) design (13-16), but none 
have presented functional data indicating that these scTCRs 
could recognize their MHC/peptide complexes or superan- 
tigens. Recently, however, the production in bacteria of a 
scTCR with the two-domain Fv design that could recognize 
its natural ligands has been reported, although the fraction 
correctly refolded was extremely low (17). 

In the present report, different scTCR designs were eval- 
uated in transfected eukaryotic cells with respect to surface 
expression of TCR molecules, proper folding, and recogni- 
tion of the appropriate MHC/peptide ligand. A three-domain 
sc construct [a-chain V (VoHinker-j3-chain V (V^jS-chain 
constant (C^)] was stably expressed on the cell surface when 
linked to a glycosyl phosphatidylinositol (GPI) anchor and 
recognized by a conformation-dependent monoclonal anti- 
body (mAb) specific for the V^17 segment. The soluble form 
of this recombinant protein could be readily obtained by 
enzymatic cleavage with phosphatidylinositol-speciftc phos- 
pholipase C (PI-PLC). Replacement of the GPI domain with 
the cytoplasmic portion of the £ chain resulted in a functional 
TCR molecule that transduced an intracellular signal follow- 
ing recognition of either the proper MHC/peptide or the 
MHC/staphylococcal enterotoxin B (SEB) superantigen 
complex. The production of a functional scTCR directly 
demonstrates the feasibility of employing sc design to pro- 
duce soluble TCRs. 

MATERIALS AND METHODS 

Construction of Recombinant TCR Molecules. cDNAs of 
TCR a and /? chains were prepared from mRNA of Hy.2H9 
cells (18) with Superscript reverse transcriptase (BRL) and an 
oligo(dT) primer (Sigma) and were amplified by PCR using 
Vent DNA polymerase (New England Biolabs) and primers 
5'-GCTCGAGGCGGCGATGGAAACTCTCCTGGGAGT-3' 
(A5) and 5'-GGAATTCAGCTGGACCACAGCCGC-3' for 
a-chain and 5'-GCTCGAGCTCTGCCATGGACTCCTG- 
GA-3' and 5 ' -GG AATTC AG AA ATCCTTTCTCTTG AC-3 ' 
for j3-chain. The cDNAs were cloned into the mammalian 
expression vector pBJ-neo (8). GPI-anchored TCR molecules 
(a-PI and 0-PI) were constructed as follows. A Ban I site was 



Abbreviations: TCR, T-cell receptor; MHC, major histocompatibil- 
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constant; IL-2, interleukin 2; mAb, monoclonal antibody. 
^Present address: Cystar, Inc., 30 Commerce Way, Woburn, MA 
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engineered after the fifth amino acid residue beyond the last 
cysteine by oligonucleotide-directed mutagenesis. The region 
3' of the Ban I site was then replaced with a Ban 1-Not I 
fragment encoding the GPI signal domain from the human 
placental alkaline phosphatase. For the construction of vari- 
ous scTCRs, variable domains of the 2H9 TCR a and p chains 
were prepared by PCR using primers A5 and 5'-CAGAGCT- 
C ACGGATG AAC A ATAAGGCTGGT-3 ' for the V 0 domain 
in all the scTCR constructs, 5'-TCGGATATCGATGGTG- 
GAATCACTCAGTCC-3' (B5) and 5 - -C AG AG ATC AG- 
CACGGTGAGCCGGTTCCC-3' for the domain in AB- 
PI-1, 5 '-GTGGG AG ATCTCTGCTTCTG ATGGCTC AAAC 
and B5 for the V„ domain in AB-PI-2, 5'-GACGGATC- 
CCCGTCTGCTCTACCCCAGGC and B5 for the V„ and 
domains in ABC-PI, and 5'-CACGGATCCCCGTCTGCTC- 
TACCCCAGGC-3' and B5 for the and C 0 domains in 
ABC-f. The cDNA encoding the transmembrane and cyto- 
plasmic domains of murine CD3 ( chain (8) was a gift of R. D. 
Klausner (National Institutes of Health). Convenient restric- 
tion sites were engineered at the end of each fragment to aid 
in the assembly of the construct. The linker was a 15-amino 
acid motif of GGGGS repeated three times (12) with Sac I at 
the 5' end and EcoRV at the 3' end. Except for <*- PI, all the 
constructs were cloned into pBJ-neo, which carries the G418- 
resistance gene. a-PI was cloned into pCEP-4 (Invitrogen), 
which bears the hygromycin-resistance gene. All constructs 
were verified by multiple restriction digests and by sequencing 
with the Sequenase kit (United States Biochemical). 

Affinity Purification and Characterization of a Soluble 
Three-Domain scTCR. After transfection and G418 selection 
(8), cells expressing a high level of GPMinked three-domain 
scTCR (ABC-PI) were isolated by three rounds of sorting. 
The resulting cells were grown in spinner culture to a density 
of 10 6 per ml and harvested by centrifugation. The pellet was 
washed twice with phosphate-buffered saline (PBS) and 
resuspended in PBS containing 2 mM Pefabloc (Centerchem, 
Stamford, CT) to a density of 5 x 10 7 per ml with PI-PLC 
(Sigma) added at 1 unit/ml. Cells were incubated at 37°C for 

1 hr with constant rocking. The supernatant was collected by 
centrifugation and by passage through a 0.45- fim filter and 
applied to a column of Acti-gel (Sterogen, Arcadia, CA) with 
immobilized /3F1. The column was washed with 10 volumes 
of PBS and the soluble TCR was eluted with 0.15 M glycine 
(pH 2.8). Fractions were immediately neutralized with 0.1 
volume of saturated Tris. The soluble TCR was then dialyzed 
against > 100 volumes of PBS at 4°C with at least four changes 
and concentrated to 0.5 rag/ml by vacuum dialysis against 
PBS. Five micrograms of purified soluble three-domain 
scTCR was analyzed by SDS/PAGE under reducing condi- 
tions. 

Stimulation of Transfectants with Antibodies, SEB, and 
Peptide/MHC Ligands, ABC-£-transfected B.W5147j3 a~p~ 
(19) cells (5 x 10 4 per well) were cultured in a 96-well 
round-bottom plate to which various antibodies had been 
immobilized (1 fxg per well). The supernatants were collected 
after 24 hr and interleukin 2 (IL-2) production was assessed 
in a bioassay using an IL-2-dependent cell line (CTLL) and 
the CellTiter-96 nonradioactive proliferation assay 
(Promega). In the case of ABC-f-transfected RBL-2H3 (8) 
cells, the cells were incubated with pHJserotonin (NEN) at 
0.5 /tCi (18.5 kBq) for 24 hr before they were added to the 
antibody plate. After incubation at 37°C for 2 hr, radioactivity 
released into the supernatant was measured in a liquid 
scintillation counter. The specific serotonin release was 
calculated as described (8). For SEB stimulation, 5 x 10 4 
transfected cells per well were cultured with various con- 
centrations of SEB (Toxin Technology, Sarasota, FL) in the 
presence or absence of 2 x 10 5 B cells. For antigen presen- 
tation, 5 x 10 4 transfected cells per well were cocultured with 

2 x 10 3 B cells which were incubated with or without the 



myelin basic protein (MBPM85-99) peptide for 3 hr before 
the experiment. The assays were conducted as described 
above. 

RESULTS 

mAb CI Recognizes a Conformational Epitope of TCR. 

Recombinant TCR molecules were generated by using the 
TCR a- and £-chain sequences of the human MBP-specific 
T-cell clone Hy.2H9 (18). This clone TCR is composed of the 
Vo3.1 and V p 17.1 segments and is specific for the immuno- 
dominant MBP peptide MBP-(85-99) in the context of HLA- 
DR2 (DRA, DRB1*16Q2) (18). Usage of the V„17 J segment 
allowed the proper folding of recombinant TCRs to be probed 
with the superantigen SEB (20) and the mAb CI (21). To 
confirm the Vpl7 specificity of CI, the extracellular domains 
of TCR a and p chains of Hy.2H9 cells were fused to the 
C-terminal sequence from human placental alkaline phospha- 
tase for GPI anchorage (Fig. 1) and the DNAs encoding the 
GPI-anchored p and a chains (0-PI and ot-PI) were sequen- 
tially transfected by electroporation (8) into a TCR a- and 
£-chain-deficient murine lymphoma cell line, BW5147 a~p~ 
(BW~) (19). The surface expression of the GPI-anchored 
TCR'chains was monitored by staining with mAbs aFl (22), 
PFl (23), and CI. aFl and pFl recognize nonconformational 
epitopes located in the C region of the TCR a and p chains, 
respectively. Surface expression of the GPI-anchored TCR j8 
chain is independent of heterodimer formation and assembly 
of the CD3 complex (7). In the /3-PI-transfected cells (Fig. 2, 
open curves), high-level expression of 0-PI was confirmed by 
staining with 0F1. Interestingly, there was little CI staining 
of these transfectants. However, when a GPI-anchored 2H9 
a chain was supertransfected into the 0-PI transfectant (Fig. 
2, shaded curves), CI reactivity was greatly increased while 
the level of /3F1 staining remained constant. Thus, the 
V^17-specificCl epitope is conformational and dependent on 
the proper pairing of TCR a and p chains and can therefore 
be used to assess the proper folding of recombinant TCRs 
bearing a Vpl7 sequence. 
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Fig. 1. Schematic representation of TCR <*- and 0-chain genes 
and various chimeric constructs. S — S, disulfide bond; L, leader; V, 
variable segment; J, joining segment; C, constant region; TM, 
transmembrane region; Cy, cytoplasmic region; ATG, start codon; 
Li, 15-residue peptide linker containing three repeats of GGGGS; PI, 
GPI domain of human placental alkaline phosphatase with the 
sequence LAPPAGTTDAAHPGRSWPALLPLLAGTLLLL (7). 
The £ region contains transmembrane and cytoplasmic domains of 
the murine CD3 { chain starting at position 31 (8). 
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Fig. 2. Cocxpression of a-FI and 0-PI arc required for CI 
reactivity. Flow cytomeric analysis of 0-PI-transfected BW~ cells 
stained with mAb aFl, 0F1, or CI before (open curves) and after 
(shaded curves) the supertransfection of the oc-PI construct. 

High-level expression of the TCR a chain (a-PI) was, 
however, not stable either alone or in the presence of 0-PL 
Attempts were made on several cell lines, including COS-7, 
CHO-K1, and a TCR-deficient variant of Jurkat cells, JK-p~ 
(J.RT3-T3.5, American Type Culture Collection). The ex- 
pression level of a-PI was comparable to that of 0-PI after the 
initial drug selection, but continued culture for less than a 
month yielded a population of cells with little surface ex- 
pression of a-PI, whereas /3-PI expression was stable (data 
not shown). The inability to obtain cell lines with stable 
high-level expression of the Pl-anchored human TCR a-chain 
has been reported by other laboratories as well (24). 

Expression and Purification of a Three-Domain scTCR. To 
overcome the limitations set by the unstable expression of the 
human TCR a chain, various sc designs were examined. 
Initially, a design similar to that of sc antibodies (Fv) was 
chosen (12). A 15-residue flexible linker was used to link the 
C terminus of the V^Ja domain to the N terminus of the p 
chain. The GPI domain was then iigated to the C terminus of 
the V^-Jp domain. The construct (AB-PI-1, Fig. 1) was 
transfected into several cell lines, including JK-0", COS-7, 
CHO-K1, and BW~. Although the expression of the gene was 
confirmed by the detection of the correct RNA transcripts 
(Fig. 3B), no surface expression was detected, as evidenced 
by negative CI antibody staining (Fig. 3A). Immunoprecip- 
itation after metabolic labeling failed to recover any Cl- 
reactive sc molecules from these transfectants (data not 
shown). The inability to identify any Cl-reactive protein 
could have been due to the design of this molecule, such as 
insufficient linker length between the extracellular domain 
and the GPI domain. To improve the accessibility of the sc 
construct, another two-domain scTCR was designed in which 
an extra 30-amino acid portion of the N terminus of the C$ 
domain was added as a hinge region. The transfectants of this 
construct (AB-PI-2, Fig. 1) were still not reactive with the CI 
antibody (data not shown). Finally, the entire C^ domain was 
added to the sc construct. A complete C^ domain should 
provide enough distance for the V^-Vp domains to be ex- 
pressed on the cell surface and, more importantly, should 
allow surface expression to be monitored with another anti- 
body, 0F1 (23). This three-domain scTCR was constructed 
by extending the TCR £-chain sequences to the residue right 
before the last cysteine (the sixth cysteine), which was then 
fused to the GPI domain. The last cysteine was deleted to 
prevent dimerization between C^ domains. This construct 
(ABC-PI, Fig. 1) was transfected into B W~ cells and surface 
expression was confirmed by staining with both £F1 and CI 
(Fig. 3C, shaded curves). Both antibodies stained the cells 
with comparable efficiency, suggesting that most of the 
molecules were expressed in the correct conformation. 
Moreover, the molecule could be efficiently cleaved from the 
cell surface with PI-PLC (Fig. 3C, open curves). 

Soluble three-domain scTCR was purified from transfec- 
tants after PI-PLC cleavage followed by affinity chromatog- 
raphy using the 0F1 antibody. The purified three-domain 
scTCR appeared as multiple bands at 50-70 kDa after SDS/ 
PAGE (Fig. 3D). The heterogeneity of scTCR is probably the 
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r Fig. 3. Flow cytomeric analysis of surface expression of scTCR 
constructs. (A) Lack of CI reactivity in BW" cells transfected with 
two-domain scTCR (AB-PI-1). (B) RNA analysis of poIy(A)-enriched 
total cell RNA isolated from BW" cells transfected with two-domain 
and three-domain scTCR constructs AB-PM and ABC-PI, respec- 
tively. Samples were analyzed in a nuclease SI protection assay (25) 
using probes specific for the. 5' end of the transcripts from the TCR - 
constructs. The coexpressed heo (G418-resistance gene) transcripts 
from the vector were analyzed with a probe at the same time as a 
control. (C) Flow cytometric analysis of ABC-PI-transfected BW" 
cells with both 0F1 and CI antibodies before (shaded curves) and 
after (open curves) PI-PLC treatment, (£>) SDS /PAGE of affinity- 
purified three-domain scTCR. (E) Comparison of CI reactivity of 
three-domain scTCRs produced from eukaryotic (Euk) and prokary- 
otic (Prok) expression systems in a two-anubody EUSA. A plateau 
is reached because the amount of £F1 attached to the plate became 
limiting. 

result of variable glycosylation; its polypeptide size calcu- 
lated from amino acid composition is 40 kDa. The structural 
integrity of the three-domain scTCR was verified by a two- 
antibody ELISA (Fig. 3£). The molecules were first captured 
by the fiFl antibody immobilized to the plate and then 
assessed for reactivity with the CI antibody. When compared 
with the three-domain scTCR produced in a bacterial expres- 
sion system (unpublished work), the scTCR from the eukary- 
otic system gave 10-20 times higher CI reactivity. The 
purified three-domain scTCR was stable and could be stored 
in PBS at 4°C for months without significant loss of CI 
reactivity. 

Functional Characterization of a Chimeric Three-Domain 
scTCR. To directly assess the functional integrity of the 
three-domain scTCR, a self-signaling scTCR was produced 
by replacing: the GPI domain with the transmembrane and 
cytoplasmic domains of the CD3 ( chain. These regions have 
been shown to be sufficient for signal transduction when its 
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extracellular fusion partner is crosslinked by an antibody or 
by the proper ligand (8, 26, 27). To enable the recovery of 
three-domain scTCR as a soluble form, a linker containing a 
thrombin cleavage site was inserted into the junction of 
three-domain scTCR and the f domain. The construct 
(ABC-0 was transfected into BW~ cells (28) and the rat 
basophilic leukemia cell line RBL-2H3 (RBL) (8), and the 
populations displaying high-level expression of three-domain 
scTCR were isolated by three rounds of cytofluorometric 
sorting using the antibody j3FL The ABC- f- transfected cells 
were first stimulated with various antibodies to confirm the 
self-signaling nature of this recombinant molecule. The signal 
transduced upon the activation of the three-domain scTCR 
was measured as IL-2 production in BW" transfectants, 
whereas serotonin release was measured in RBL transfec- 
tants. Both transfectants showed a strong response following 
0F1 and CI stimulation but not to purified mouse immuno- 
globulin or anti-CD8 antibody used as controls (Fig. 4 A and 
B). The structural integrity of the scTCR was further exam- 
ined with the superantigen SEB, which binds to both V^17 



and MHC class II molecules, resulting in TCR crosslinking 
and T-cell activation regardless of the peptide bound to the 
MHC molecule (20, 29). ABC-f transfectants displayed a 
concentration-dependent response toward SEB (Fig. 4C) 
when the superantigen was presented by transformed B-cell 
lines with high-level expression of DR1 (DRA, DRB1*0101; 
cell line LG2) or DR2 (DRA, DRB1*1602; cell line 9016). 
Thus, the lateral face of the TCR V> region to which SEB is 
thought to bind (30) is structurally intact. 

to prove that the three-domain scTCR did indeed recog- 
nize the MHC/peptide ligand, antigen presentation experi- 
ments using the natural ligand for the Hy.2H9 clone, 9016 
cells bearing the DRB1*1602 allele of DR2, and MBP-(85-99) 
peptide were performed. To ensure detection of subtle ab- 
normalities in the structure of the three-domain scTCR, 9009 
cells (DRA, DRB 1*1601), which also bind MBP-(85-99), 
were used as a control. DRB1*1601 and DRB1H602 differ 
only at position 67 in the DR^l domain; this TCR contact- 
residue substitution does, however, abolish recognition of 
the peptide by the parent T-cell clone (ref. 18; K.W.W., 
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Fig. 4. (A) [ 3 H]Serotonin release following TCR crosslinking of three-domain scTCR-£ chimeric construct (ABC-0-transfected RBL cells. 
CI is specific to Vsl7 and /3F1 is specific to the C region of TCR 0 chain, mig, mouse immunoglobulin. (B) IL-2 release following TCR crosslinking 
of ABC-f-transfected BW" cells. (C) [ 3 H]Serotonin release following SEB stimulation of ABC-f-transfected RBL cells. (£>) MHC-restncted 
antigen-specific response from the ABC- transfected BW~ ceils. (E) Dose-response curve of MHC/peptide recognition of ABC-f-transfected 
RBL cells. 
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unpublished work). The ABC-£-transfected BW" cells se- 
creted 1L-2 in response to peptide-pulsed 9016 cells, but not 
to" peptide-pulsed 9009 cells (Fig. 4D). Similar results were 
obtained with RBL transfectants (Fig. 4£), as serotonin 
release was dependent on the concentration of the MBP 
peptide used to pulse 9016 cells. The signal appeared to be 
weak when compared with antibody stimulation. This is not 
surprising, however, since saturating amounts of antibodies 
are expected to crosslink the majority of TCR molecules on 
the target cell, whereas a much smaller fraction of TCR 
molecules is probably engaged when T cells are cocultured 
with peptide-pulsed antigen-presenting cells, in which a max- 
imum of 5-15% of the DR molecules bind the peptide. The 
requirement for a high concentration of peptide or SEB is not 
due to the sc design, since high concentrations of peptide are 
also needed to stimulate ap heterodimers of the TCR-f 
constructs (8). It is likely that the decrease in sensitivity 
results from the lack of CD3, CD4, and/or other adhesion/ 
signaling molecules. Nonetheless, these results demonstrate 
that the three-domain scTCR was correctly folded and func- 
tionally competent. In addition, a soluble form of three- 
domain scTCR could be obtained from the ABC-£ transfec- 
tants by thrombin cleavage and affinity purification (data not 
shown). 

DISCUSSION 

A scTCR molecule was designed which contains the V 
domains of both a and )3 chains and the C domain of the 
chain. This scTCR molecule could be stably expressed at a 
high level in eukaryotic cells and could be isolated in a soluble 
form by enzymatic cleavage and affinity chromatography. 
The V a and Vp domains appeared to be properly paired , since 
the scTCR bound to a conformation-dependent mAb, the 
superantigen SEB , and the proper MHC/peptide ligand . This 
design of scTCR offers an alternative to the two-chain design 
of soluble TCRs and has several advantages. (/) The sc design 
avoids the low-efficiency dimerization process which may be 
the limiting step in the assembly of TCR heterodimers from 
a and subunits made in Escherichia coli. The sc design 
therefore allows efficient expression of the recombinant 
protein in quantities suitable for structural analysis »and for 
some diagnostic or therapeutic applications, (ii) The design 
avoids the problems associated with the unstable expression 
of the human TCR a chain that have hindered efficient 
expression of human TCR molecules in eukaryotic cells. (Hi) 
The sc design may allow the construction of TCR phage 
display libraries similar to those made for sc antibodies (28, 
31). scTCR phage libraries may be. powerful tools for the 
isolation of TCRs with defined specificities and/or high 
1 affinity for selective targeting of malignant and virally in- 
fected cells and for analyzing the interactions among TCR, 
MHC/peptide complexes, and superantigens. 

Unlike antibodies, separately expressed V domains of TCR 
a and /3 chains have not been reconstituted to form het- 
erodimers (ref . 32; K. L. Hilyard, personal communication). 
However, despite this success in producing the three-domain 
scTCR, a two-domain scTCR with detectable CI reactivity 
could not be produced either in eukaryotic cells or in bacte- 
ria. The presence of the Cp domain (or part of the domain) 
may be required for the proper folding and/or stabilization of 
a scTCR molecule. In any event, the three-domain design 
provides a general means for the efficient production of 
functional scTCR molecules. 
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It has not been possible to directly test these possibilities 
because the generation of TCRs with affinities above 10 7 M" 1 
has not been accomplished. In addition to allowing a kinetic 



T cell receptors (TCRs) exhibit genetic and structural diversity 
similar to antibodies, but they have binding affinities that are 
several orders of magnitude lower. It has been suggested that TCRs 
undergo selection in vivo to maintain lower affinities. Here, we 
show that there is not an inherent genetic or structural limitation 
on higher affinity. Higher-affinity TCR variants were generated in 
the absence of in vivo selective pressures by using yeast display 
and selection from a library of Va CDR3 mutants. Selected mutants 
had greater than 100-fold higher affinity (K D ~ 9 nM) for the 
peptide/MHC ligand while retaining a high degree of peptide 
specificity. Among the high-affinity TCR mutants, a strong prefer- 
ence was found for CDR3a that contained Pro or Gly residues. 
Finally, unlike the wild-type TCR, a soluble monomeric form of a 
high-affinity TCR was capable of directly detecting peptide/MHC 
complexes on antigen-presenting cells. These findings prove that 
affinity maturation of TCRs is possible and suggest a strategy for 
engineering TCRs that can be used in targeting specific peptide/ 
MHC complexes for diagnostic and therapeutic purposes. 

T cells recognize a foreign peptide bound to the MHC product 
through the a/3 heterodimeric receptor. The T cell receptor 
TCR) repertoire has extensive diversity created by the same 
,ene rearrangement mechanisms used in antibody heavy- and 
light-chain genes (1). Most of the diversity is generated at the 
junctions of V and J (or diversity, D) regions that encode the 
complementarity-determining region three (CDR3) of the a and 
/3 chains (2). However, TCRs do not undergo somatic point 
mutations as do antibodies, and perhaps not coincidentally, 
TCRs also do not undergo the same extent of affinity maturation 
as antibodies. TCRs appear to have affinities that range from 10 5 
to 10 7 M -1 whereas antibodies have affinities that range from 10 5 
to 10 10 M" 1 (3, 4). 

Whereas the absence of somatic mutation in TCRs may be 
associated with lower affinities, it has also been argued that there 
is not a selective advantage for a TCR to have higher affinity 
(5-7)^ In fact, the serial-triggering (6) and kinetic proofreading 
(7) models of T cell activation both suggest that very slow 
off-rates (associated with higher affinity) would be detrimental 
to the signaling process. On the other hand, the fastest off-rates 
that have been measurable have been associated with altered 
pMHC that exhibits antagonist activity (8-11). Whereas the 
narrow range of natural TCR affinities has provided some 
evidence for the relationships between off-rates and agonist/ 
antagonist activity, there are also examples that appear to be 
inconsistent with these hypotheses (12, 13). 

There are other possible explanations for why the T cell system 
maintains relatively low TCR:pMHC affinities in vivo. Peptides 
bound within the MHC groove display limited accessible surface 
(14), which may in turn limit the amount of free energy that can 
be generated in the interaction. On the other hand, raising the 
affinity of a TCR by directing the free energy toward the MHC 
lelices would presumably lead to thymic deletion during nega- 
tive selection (15). Even if such higher-affinity TCR could escape 
thymic deletion, they would likely not maintain the peptide 
specificity required for T cell responses. 



basis of T cell triggering, high-affinity TCRs could be used to 
more easily explore the role of peptide in pMHC recognition, 
and as quantitative probes for the expression of pMHC on 
various target cells. Because in vivo selection schemes have not 
yielded TCRs with the intrinsic binding affinities of affinity- 
matured antibodies, in this report, we have used an in vitro 
method for the directed evolution of high-affinity TCRs. The 
method relies on the expression of a library of mutant single- 
chain ( V/3-linker-Va) TCRs on the surface of yeast, as a fusion 
to the surface protein Aga-2 (16, 17). Our previous'studies have 
shown that the yeast display system could be used to engineer 
variants of the 2C single-chain TCR (scTCR) that were more 
thermally stable and secreted at higher levels (17, 18). The 
stability mutants were isolated by subjecting the entire TCR gene 
to random mutagenesis and selecting for increased surface levels 
with anti-TCR antibodies (17). The mutations that increased 
stability resided at the Va: Vj3 interface or on the outside surface 
of in a region not involved in pMHC binding. To isolate TCR 
with higher affinity for pMHC, in the present study, we mutated 
only the CDR3a loop, which is at the center of the pMHC- 
binding site (19). Our efforts were guided by previous findings 
that this region contributed minimal binding free energy to the 
interaction of the 2C TCR with the pMHC ligand QL9/L d (20), 
suggesting that productive interactions might be improved by 
focusing on this region. Remarkably, selection from a relatively 
small library (10 5 mutants) yielded many different TCRs with up 
to 100-fold increased affinity for QL9/L d . The high-affinity 
TCRs retained a high degree of peptide specificity although 
there was some variation in fine specificity among the mutants. 
These findings suggest that the in vitro evolution process de- 
scribed here can be used to isolate TCRs with specificities that 
one defines by selection with appropriate pMHC ligands. 

The high-affinity receptors in this study were derived by 
variation at the VJ junction, the same process that operates very 
effectively in vivo through gene rearrangements in T cells (2). 
The fact that we could readily isolate a diverse set of high-affinity 
TCR in vitro indicates that there is not a genetic or structural 
limitation to high-affinity receptors. This supports the view that 
inherently low affinities of TCRs found in vivo are caused by a 
lack of selection for higher affinity and perhaps a selection for 
lower affinity (5-7). Finally, the high-affinity TCR were used in 
monomeric form to detect pMHC on the surface of target cells, 
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Fig. 1. Flow cytometric analysis of yeast cells that express wild-type and 
mutant 2C TCR on their surface. Yeast cells displaying wild-type (17) and 
mutant (m6 and ml 3) scTCR were stained with anti-V/38 antibody F23.2 
(120 nM), the specific alloanttgenic peptide-MHC, QL9/L d /lg (40 nM), or a null 
peptide MCMV/L d /!g (40 nM). The peptides used in this study were QL9 
(QLSPFPFDL), MCMV (YPHFMPTNL), and p2Ca (LSPFPFDL). Binding was de- 
tected by FITC-conjugated goat anti-mouse IgG F(ab')2 and analyzed by flow 
cytometry. The negative population (e.g., seen with F23.2 staining) has been 
observed for all yeast-displayed proteins and is thought to be caused by cells 
at a stage of growth or induction that are incapable of expressing surface 
fusion protein (16, 17, 27). 

indicating that soluble forms of the TCR selected with the yeast 
display system can serve as probes for tumor-associated pMHC 
or other T cell-specific ligands. 

Materials and Methods 

Library Construction. The 2C single-chain TCR (scTCR) used as 
the scaffold for directed evolution (T7) contained six mutations 
(0G17E, /3G42E, 0L81S, aL43P, aW82R, and aI118N) that 
have been shown to increase the stability of the TCR but still 
allow pMHC binding (E.V.S., KD.W., and D.M.K., unpublished 
results; and ref. 18). Mutagenic PCR of the T7 scTCR VaCDR3 
was performed by using an AGA-2-specific upstream primer and 
a degenerate downstream primer 5 ' - CTTTTGTGCCGGATC- 
CAAATGTCAG(SNN) 5 GCTCACAGCACAGAAGTACACG- 
GCCGAGTCGCTC-3'. Underlined bases indicate the positions 
of silent mutations introducing unique BamYLl and Eagl restric- 
tion sites. The purified PCR product was digested with Ndel and 
BamHL and ligated to Ndel-BamKL-digested T7/pCT302 (16- 
18). The ligation mixture was transformed into DH10B electro- 
competent Escherichia coli (GIBCO/BRL), and transformants 
were pooled into 250-ml LB supplemented with ampiciliin at 
100 iLg/ml and grown overnight at 37°C. Plasmid DNA was 
transformed into the yeast strain EBY100 by the method of Gietz 
and Schiestl (21). 

Cell Sorting. The yeast library (22) was grown in 2% dextrose/ 
3.67% yeast nitrogen base/1% Casamino acids (Difco) at 30°C 
to an ODfioo = 4.0. To induce surface scTCR expression, yeast 
were pelleted by centrifugation, resuspended to an OD600 = 1.0 
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Rg. 2. Structure and sequences of the 2C TCR CDR3a. iA) X-ray crystallo- 
graphic structure of the 2C/dEV8/K b complex with CDR3a aa highlighted. Five 
residues of the 2C VaCDFG that were randomized by PCR with a degenerate 
primer are shown in red. The adjacent CDR3 residues, Ser-93 and Leu-104 
shown in blue, were retained in the yeast display library because they have 
been shown to be important in pMHC binding (1 7, 1 8, 20). (B) Alignment of aa 
sequences of mutant scTCRs isolated by yeast display and selection with 
QL9/L d . Display plasmids were isolated from yeast clones after selection and 
sequenced to determine CDR3<i sequences. Mutants ml, m2, m3, m4, m10, 
andml 1 were isolated after the third round of sorting. All other mutants were 
isolated after the fourth round of sorting. 

in 2% galactose/0.67% yeast nitrogen base/1% Casamino acids, 
and incubated at 20°C for ~24 h. In general, ~10 7 cells per tube 
were incubated on ice for 1 h with 50 /il of QL9/L d /IgG dimers 
(23) diluted in PBS (pH 7.4) supplemented with 0.5 mg/ml BSA. 
After incubation, cells were washed and labeled for 30 min with 
FITC-conjugated goat anti-mouse IgG F(ab')2 (Kirkegaard & 
Perry) in PBS (pH 7.4) supplemented with 0.5 mg/ml BSA. 
Yeast were then washed and resuspended in PBS (pH 7.4) 
supplemented with 0.5 mg/ml BSA immediately before sorting. 
Cells exhibiting the highest fluorescence were isolated by using 
a Coulter 753 bench fluorescence-activated cell sorter. After 
isolation, sorted cells were expanded in 2% dextrose/0.67% 
yeast nitrogen base/1% Casamino acids and induced in 2% 
galactose/0.67% yeast nitrogen base/1% Casamino acids for 
subsequent rounds of selection. A total of four sequential sorts 
were performed. The concentrations of QL9/L d /IgG dimers 
used for staining were 50 pLg/ml for sorts one to three and 
0.5 /x.g/ml for the final sort. The percentages of total cells 
isolated from each sort were 5.55, 2.68, 2.56, and 0.58%, 
respectively. Aliquots of sorts three and four were plated on 2% 
dextrose/0.67% yeast nitrogen base/1% Casamino acids to 
isolate individual clones which were analyzed by flow cytometry 
by using a Coulter Epics XL instrument. 

Soluble scTCR Production. The T7 and m6 scTCR genes were 
excised from pCT302 Nhel-Xhol and ligated into Nhel-Xhol 
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digested pRSGALT, a yeast expression plasmid (18). Ligations 
were transformed into DH10B electrocompetent E. coli 
(G1BCO/BRL). Plasmid DNA was isolated from bacterial cul- 
tures and transformed into the Saccharomyces cerevisiae strain, 
BJ5464 (a ura3-52 trpl \eu2Ll his3b200 pep4:JJIS3 prblM.6R 
canl GAL) (18). Yeast clones were grown in 1 liter of 2% 
dextrose/0.67% yeast nitrogen base/1% Casamino acids/20 
mg/liter Trp for 48 h at 30°C To induce scTCR secretion, cells 
were pelleted by centrifugation at 4,000 X g, resuspended in 1 
liter of 2% galactose/0.67% yeast nitrogen base/1% Casamino 
acids/20 mg/1 Trp supplemented with 1 mg/ml BSA, and 
incubated for 72 h at 20°C. Culture supernatants were harvested 
by centrifugation at 4,000 X g, concentrated to «50 ml, and 
dialyzed against PBS (pH 8.0). The six His-tagged scTCRs were 
purified by native nickel affinity chromatography [Ni-NTA 
Superflow, Qiagen (Chatsworth, CA); 5 mM and 20 mM imi- 
dazole (pH 8.0) wash; 250 mM imidazole elution] (18). 

Cell-Binding Assays. The binding of soluble scTCRs to QL9/L d 
was monitored in a competition format as described (20, 24). 
Peptide-loaded T2-L d cells (3 X 10 5 per well) were incubated for 
1 h on ice in the presence of 125 Mabeled anti-L d Fabs (30-5-7) 
and various concentrations of scTCRs. Bound and unbound 
[ 125 I] 30-5-7 Fabs were separated by centrifugation through olive 
oil/dibutyl phthalate. Inhibition curves were constructed to 
determine inhibitor concentrations yielding 50% maximal inhi- 
bition. Dissociation constants were calculated by using the 
formula of Cheng and Prusoff (25). To monitor direct binding of 
scTCRs to cell-bound pMHC, peptide-loaded T2-L d cells (5 X 
10 5 per tube) were incubated for 40 min on ice with biotinylated 
soluble scTCRs followed by staining for 30 min with streptavi- 
din-phycoerythrin (PharMingen). Cellular fluorescence was de- 
tected by flow cytometry. 

Results and Discussion 

To examine if it is possible to generate higher- affinity TCR that 
would retain peptide specificity, we subjected a TCR to a process 



of directed in vitro evolution. Phage display (26) has not yet 
proven successful in the engineering of single-chain TCRs 
(scTCRs, V/Minker-Va) despite the extensive structural simi- 
larity between antibody and TCR V regions. However, we 
recently showed that a scTCR could be displayed on the surface** 
of yeast (17) in a system that has proven successful in antibody 
engineering (16, 27). A temperature-stabilized variant (T7) (18) 
of the scTCR from the cytotoxic T lymphocyte clone 2C was used 
in the present study. Cytotoxic T lymphocyte clone 2C recog- 
nizes the alloantigen L d with a bound octamer peptide called 
p2Ca, derived from the enzyme 2-oxogIutarate dehydrogenase 

(28) . The nonameric variant QL9 is also recognized by cytotoxic 
T lymphocyte 2C, but with 10-fold higher affinity by the 2C TCR" 

(29) . Alanine scanning mutagenesis showed that the CDR3a 
loop contributed minimal free energy to the binding interaction 
(20), even though structural studies have shown that CDR3a of 
the 2C TCR is near the peptide and it undergoes a conforma- 
tional change to accommodate the pMHC complex (19). Thus, 
we focused our mutagenesis efforts on five residues that form the 
CDR3a loop. 

A library of 10 5 independent TCR-CDR3a yeast mutants was 
subjected to selection by flow cytometry with a fluorescently 
labeled QL9/L d ligand (23). After four rounds pf sorting and 
growth, 15 different yeast colonies were examined for their 
ability to bind the ligand, in comparison to the scTCR variant T7, 
which bears the wild-type CDR3a sequence (Fig. 1 and data not 
shown). The anti-V)38.2 antibody F23.2 which recognizes resi- 
dues in the CDR1 and CDR2 was used as a control to show that 
wild-type scTCR-T7 and scTCR mutants (m6 and ml3, in Fig. 
1, and others, data not shown) each had approximately equiv- 
alent surface levels of the scTCR (Fig. 1). In contrast, the soluble 
QL9/L d ligand bound very well to each mutant yeast clone but 
not to wild-type scTCR-T7. The MCMV/L d complex, which is 
not recognized by cytotoxic T lymphocyte clone 2C, did not bind 
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Fig. 4. QL9/L d binding by soluble scTCRs. T2-L d cells loaded with QL9 were 
incubated with ^-labeled anti-L d Fab fragments (30-5-7) and various 
concentrations of unlabeled Fab (□), scTCR-T7 <■), or mutant scTCR-m6 {•) 
Bound and unbound P"l]30-5-7 Fab fragments were separated by cen- 
tnfugation through olive oil/dibutyl phthalate. Binding of 12S l-labeled 
anti-L d Fab fragments to T2-L d cells loaded with the control peptide MCMV 
was not inhibited even at the highest concentrations of scTCRs (data not 
shown). 



to the scTCR mutants or wild-type scTCR-T7, indicating that the 
scTCR mutants retained peptide specificity. 

The CDR3a sequences of the 15 mutants all differed from the 
2C TCR (Fig. 2). It was readily apparent (and confirmed by a 
blast alignment algorithm) that the sequences could be aligned 
into two motifs. One motif contained Gly in the middle of the 
five residue stretch whereas the other motif contained three 
tandem Pro. Evidence that all three Pro are important in 
generating the highest affinity site is suggested by results with 
mutant mil. Mutant mil contained only two of the three Pro 
and exhibited reduced binding compared with the triple-Pro 
mutants (data not shown). The Gly-containing mutants ap- 
peared to have preferences for positive-charged residues among 
the two residues to the carboxyl side (7/9) and aromatic and/or 
positive-charged residues among the two residues to the amino 
side (4/9 and 5/9). The selection for a glycine residue at position 
102 in the motif may indicate that the CDR3a loop required 
conformational flexibility around this residue to achieve in- 
creased affinity. This is consistent with the large (6-A) confor- 
mational difference observed between the CDR3a loops of the 
Uganded and unliganded 2CTCR (19). It is also interesting that 
Gly is the most common residue at the V(D)J junctions of 
antibodies and the presence of a Gly has recently been associated 
with increased affinity in the response to the (4-hydroxy-3- 
nitrophenyl) acetyl hapten (30). 

In contrast to the isolates that contain Gly, the selection for 
a Pro-rich sequence at the tip of the CDR3a loop may suggest 
that these TCR exhibit a more rigid conformation that confers 
higher affinity. The x-ray crystallographic structures of a germ- 
line antibody of low affinity compared with its affinity-matured 
derivative showed that the high-affinity state was associated with 
stabilization of the antibody in a configuration that accommo- 
dated the hapten (31). Similarly, the NMR solution structure of 
a scTCR that may be analogous to the germ-line antibody 
showed that the CDR3a and j3 loops both exhibited significant 
mobility (32). Recent thermodynamic studies of TCR:pMHC 
interactions have also suggested the importance of conforma- 
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Fig. 5. Flow cytometric analysis of the binding of scTCR/biotin to cell 
surface peptide/MHC Peptide-loaded T2-L d cells were incubated with 
biotmylated m6 scTCR (-0.3 M M) or T7 scTCR (- 1 .6 M M) scTCR followed by 
streptavidin-PE and analyzed by flow cytometry. {A) Flow cytometry his- 
tograms of T2-L d cells loaded with QL9 (unshaded), p2Ca (light shade), or 
MCMV (dark shade) and stained with m6 scTCR/biotin. (B) Mean fluores- 
cent units (MFU) of T2-L d cells loaded with QL9, p2Ca, or MCMV and stained 
with either secondary SA-PE only T7 scTCR/biotin + SA-PE, or m6 scTCR/ 
biotin + SA-PE. 



tional changes in binding (33, 34). Structural and thermodynamic 
studies of the TCR mutants that we report here should allow us 
to examine if the two CDR3a motifs (Gly- versus Pro-rich) might 
differ in the mechanism by which they confer higher affinity. 

Although the scTCR mutants did not bind the null peptide/L d 
complex MCMV/L d , it remained possible that the increase in 
affinity might be accompanied by a change in fine specificity. To 
examine this issue, we used QL9 position 5 (Phe) peptide 
variants that have been shown previously to exhibit significant 
differences in their binding affinity for the wild-type 2C TCR 
(35). The binding of these pMHC to various TCR mutants on the 
yeast surface and clone 2C were measured by flow cytometry. As 
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shown in Fig. 3, the native TCR on 2C is capable of binding QL9 
variants that contain either Tyr or His at position five but not 
--. Glu. Each of the higher-affinity TCR mutants retained their 
ability to recognize the conserved Tyr-substituted peptide and 
they were likewise incapable of recognizing the Glu-substituted 
peptide. However, several of the TCR mutants (m6 f m9, and 
ml3) bound to the His-substituted peptide (albeit to different 
extents) whereas other mutants (m5, m7, and ml4) did not bind 
the peptide. Thus, the CDR3a loop can influence the peptide 
fine specificity of recognition but it is not the only region of the 
TCR involved. The effect on peptide specificity could be through 
direct interaction of CDR3a residues with the variant peptide, as 
suggested from earlier studies involving CDR3-directed selec- 
tions (36, 37). Alternatively, binding free energy may be directed 
at peptide-induced changes in the L d molecule itself. The latter 
possibility is perhaps more likely in the case of the 2C 
TCR:QL9/L d interaction, because position five of QL9 has been 
predicted to point toward the L d groove (35, 38). The fine- 
specificity analysis also shows that it is possible to engineer TCRs 
with increased, or at least altered, specificity for cognate pep- 
tides. Thus, directed evolution of only a short region (CDR3a) 
of a single TCR allows the design of TCR variants with altered 
peptide-binding specificities. 

To determine the magnitude of the affinity increases associated 
with a selected CDR3a mutant, the wild-type T7 scTCR and the m6 
scTCR were expressed as soluble forms in a yeast secretion system. 
Purified scTCR preparations were compared for their ability to 
block the binding of a 125 I-labeled anti-L d Fab fragments to QL9 or 
MCMV loaded onto L d on the surface of T2-L d cells. As expected, 
neither T7 nor m6 scTCR were capable of inhibiting the binding of 
l25 I-Fab fragments to T2-L d cells up-regulated with the MCMV 
peptide (data not shown). However, both T7 and m6 were capable 
of inhibiting the binding of anti-L d Fab fragments to QL9 /L d (Fig. 
4). The m6 scTCR variant was as effective as unlabeled Fab 
fragments in inhibiting binding, whereas the T7 scTCR was 160-fold 
less effective (average of 140-fold difference among four indepen- 
dent titrations). The Ko values of the scTCR for QL9/L d were 
calculated from the inhibition curves to be 1.5 fjM for T7 and 9.0 
nM for m6. The value for T7 is in close agreement with the 3.2 /iM 
K D reported for the 2C scTCR (39). These findings show that the 
yeast system, combined with CDR3a-directed mutagenesis, is 
capable of selecting mutants with 100-fold higher intrinsic binding 
affinities for a pMHC ligand. 

If the soluble scTCR has a high affinity for its pMHC ligand, 
then it may be useful, like antibodies, as a specific probe for 

1. Tonegawa, S. (1988) BiascL Rep. 8, 3-26. 

1 Davis, M. M. & Bjorkman, P. J. (1988) Nature (London) 334, 395-402. 

3. Eisen, H R, Sykulev, Y. & Tsomides, T, J. (1996) Adv. Protein Own. 49, 1-56. 

4. Davis, M. M„ Boniface, J. J., Reich, Z M Lyons, D., Hampl, J., Arden, B. & 
Chien, Y. (1998) AnniL Rev. Immunol. 16, 523-544. 

5. Sykulev, Y., Cohen, R. J. & Eisen, H. N. (1995) Proc. Natl. Acad. ScL USA 92, 
11990-11991 

6. Valitutti, S., Muller, S., Cella, M., Padovan, E. & Lanzavecchia, A. (1995) 
Nature (London) 375, 148-151. 

7. Rabinowitz, J. D., Beeson, C, Lyons, D. S-, Davis, M. M. & McConnell, H. M. 
(1996) Proc Natl Acad. Set USA 93, 1401-1405. 

8. Rabinowitz, J. D., Beeson, C Wulfing, C, Tate, K., Allen, P. M, Davis, M. M 
& McConnnell, H. M. (1996) Immunity 5, 125-135. 

9. Sloan-Lancaster, J. & Allen, P. M. (1996) Annu. Rev. Immunol 14, 1-27. 

10. Alam, S. M., Travers, P. J., Wung, J. L., Nasholds, W., Redpath, $„ Jameson, 
S. C & Gascoigne, R. J. (1996) Nature (London) 381, 616-620. 

11. Kersh, G. J., Kerch, E. N., Fremont, D. H. & Allen, P. M. (1998) Immimity 9 t 
817-826. 

12. Al-Ramadi, B. K., Jelonek, M. T., Boyd, U F., Margulies, D. H. & Bothwell, 
A. L. M. (1995) /. Immunol. 155, 662-673. 

13. Sykulev, Y„ Vugmeyster, Y., Brunmark, A., Ploegh, H. L & Eisen, H. N. (1998) 
Immunity 9, 475-483. 

14. Bjorkman, P. J. (1997) Celi 89, 167-170. 

15. Bevan, M J. (1997) Immunity 7, 175-178: 

16. Boder, E. T. & Wittrup. K. D. (1997) Nat. Biotechnot. 15, 553-557. 



cell surface-bound antigen. To test this possibility, the soluble 
T7 and m6 scTCR were biotinylated and the labeled scTCR 
were incubated with T2-L d cells loaded with QL9, p2Ca, or 
MCMV. The m6 scTCR, but not the T7 scTCR, yielded easily 
detectable staining of the T2 cells that had been incubated with 
QL9 or p2Ca (Fig. 5 A and B). It is significant that p2Ca-up- 
regulated cells were also readily detected by m6 scTCR, 
because p2Ca is the naturally processed form of the peptide 
recognized by the alloreactive clone 2C and it has an even 
lower affinity than the QL9/L d complex for the 2C TCR (29). 
However, it remains to be determined if the levels of pMHC 
derived from endogenous antigen processing are sufficient to 
allow detection by using soluble TCR as probes. It is reason- 
able to predict that, in some cases, the level will be too low to 
distinguish from background by using standard flow cytometry 
procedures. 

The high-affinity receptors described in our study were de- 
rived by variation at the VJ junction, the same process that 
operates very effectively in vivo through gene rearrangements in 
T cells (2). The fact that we could readily isolate a diverse set of 
high-affinity TCR in vitro indicates that there is not a genetic or 
structural limitation to high-affinity receptors. This supports the 
view that inherently low affinities of TCRs found in vivo are 
caused by a lack of selection for higher affinity and perhaps a 
selection for lower affinity (5-7). In this respect, the higher- 
affinity TCRs now provide the reagents for directly testing 
hypotheses about the effects of affinity on T cell responses (4-7). 
It is interesting to note that similar arguments have been used to 
suggest that the kinetic properties of antibodies may also set an 
in vivo "affinity ceiling," above which there may not be a selective 
advantage to B cells (40). 

In addition to their utility for testing T cell responses, 
high-affinity TCRs can be engineered like antibodies to yield 
high-affinity, antigen-specific probes. Soluble versions of 
the high-affinity receptor can directly detect specific peptide/ 
MHC complexes on cells (Fig. 5). Thus, these engineered 
proteins have potential, for example, as tumor cell diagnostics, 
or on conjugation with cytotoxins, potential agents for cancer 
therapy. 
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